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Abstract 

Let X and y be points chosen uniformly at random from Zfj, the four-dimensional discrete 
torus with side length n. We show that the length of the loop-erased random walk from x 
to y is of order n^(logn)'^/^, resolving a conjecture of Benjamini and Kozma. We also show 
that the scaling limit of the uniform spanning tree on Zf^ is the Brownian continuum random 
tree of Aldous. Our proofs use the techniques developed by Peres and Revelle, who studied 
the scaling limits of the uniform spanning tree on a large class of finite graphs that includes 
the d-dimensional discrete torus for d > 5, in combination with results of Lawler concerning 
intersections of four-dimensional random walks. 



1 Introduction 

Given a finite connected graph G, a spanning tree of G is a subgraph of G which contains every 
vertex of G and has no cycles. Every connected graph G has at least one spanning tree, and a 
spanning tree picked uniformly at random from the set of all possible spanning trees is called a 
uniform spanning tree. In this paper, we study the scaling limit of the uniform spanning tree on 
the graph = {0, 1, . . . , n — 1}^, the four-dimensional discrete torus of side length n, as n ^ oo. 

Previously, Aldous [Ij studied the scaling limit of the uniform spanning tree on the complete 
graph on m vertices as m ^ oo. He showed that the limit is an object called the continuum 
random tree (CRT), which we now describe using the line-breaking construction in section 4 of 
[2]. Consider an inhomogeneous Poisson process on [0, oo) with rate r{t) = t, and denote the 
points of this Poisson process by < ti < ^2 < • • • • To construct the CRT, begin with a single 
line segment of length ti, and label the endpoints zi and Z2- Then choose a random point on this 
segment, attach to it another line segment of length t2 — ti, and label Z3 the endpoint of this new 
segment that is not on the line connecting zi and Z2. To continue the process inductively, once k 
line segments have been added to the tree, add an additional line segment of length tk+i — tfc to a 
uniformly chosen point on the existing tree, and label the endpoint of the new line segment that 
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was not on the previous tree Zk+2- We view each Hne segment as being orthogonal to all previous 
line segments, and the limiting object that we get as k ^ oo, called the CRT, is a random metric 
space. The points zi,Z2, ■ ■ ■ can be viewed as points chosen uniformly at random from a "mass 
measure" on the CRT. If d{zi,Zj) denotes the distance between the points Zi and Zj, then we 
denote by fik the joint distribution of the distances (d{zi, Zj))i<i^j<k- 

To state more precisely Aldous' result for the uniform spanning tree on the complete graph, 
let Km denote the complete graph on m vertices, and let T be a uniform spanning tree on Km- 
Fix a positive integer k, and let yi, ■ ■ ■ ,yk be vertices chosen independently and uniformly at 
random from Km- Let d{yi,yj) be the distance between yi and yj, that is, the number of vertices 
along the path in T from to yj. Then, as m — > oo, 

(1) 

V V"^ / i<i<j<k 

Recently, Peres and Revelle [15] proved a conjecture of Pitman by showing that the CRT also 
arises as the limit of the uniform spanning tree on a large class of finite graphs that includes the 
hypercubes Z^, expander graphs, and the d-dimensional torus when d > 5- Parts of their 
proof, however, do not work in dimension four. Our main result, stated below, is that the CRT 
is also the scaling limit of the uniform spanning tree on Z^. 

Theorem 1.1. Fix a positive integer k, and let xi, . . . ,Xk be points chosen independently and 
uniformly at random from the four- dimensional discrete torus Z^. Let T he a uniform spanning 
tree on Z^, and let d{xi,Xj) be the number of vertices along the path in T from xi to xj- There 
exists a sequence of constants {'yn)^=i, bounded away from zero and infinity, such that the joint 
distribution of the distances 

7nn2(logn)^/^ ' 
with 1 < i < j < k, converges to fi^ as n ^ oo. 

Note that Theorem 11.11 implies that the typical distance in the uniform spanning tree between 
two randomly chosen points in Z^ is of order n^(logn)^/^, as compared with n'*/^ on Z^ with 
d > 5- The exponent of 1/6 in the logarithmic correction was conjectured by Benjamini and 
Kozma [1]. In view of the connections between the uniform spanning tree and the loop-erased 
random walk, to be reviewed in the next subsection, and the fact that Brownian motion is the 
scaling limit of loop-erased random walk on Z^, it is natural that the CRT is the scaling limit of 
the uniform spanning tree on the torus in dimension four. We do not expect to get the CRT in 
the limit in dimensions below four. See Lawler, Schramm, and Werner p^J for a discussion of the 
uniform spanning tree in two dimensions, and Kozma [8] for recent progress on the scaling limit 
of the loop-erased random walk in three dimensions. See also [U] for results on the dynamics 
of the loop-erased random walk on the torus in dimensions four and higher which are based on 
developments in the present paper. 

1.1 Loop-erased random walks and Wilson's algorithm 

We recall here the definition of a loop-erased random walk. A path in a graph G is a sequence of 
vertices uo,ui, - - - ,Uj such that Ui-i and Ui are connected by an edge for i = 1, . . . , j. Given a 



2 



path A in G, let |A| denote the number of vertices in the path A. Also, let LE{X) denote the loop- 
erasure of A, obtained by erasing loops from the path A in chronological order. More formally, if 
A = (uq, ni, . . . , Uj), then LE{X) is the path (vq, . . . , Vk) obtained inductively as follows. First set 
vq = uq. Suppose Vq, . . . ,Vm have been defined for some m > 0. Ifvm = Uj, then k = m and Vm is 
the last vertex in the path LE{\). Otherwise, define Vm+i = ^r+ii where r = max{i : Ui = Vm}- 

A random walk on a graph G is a Markov chain X = (Xj)^q taking its values in the set 
of vertices of G such that P{Xt^i = w\Xt = v) = where we write d{v) for the 

degree of the vertex v and v ^ w whenever v and w are connected by an edge. That is, at 
each step the random walk moves to a neighboring vertex of the graph, chosen uniformly at 
random. We denote the random walk segment (Xj)^^^ by X[a, b]. Given a vertex x and a subset 
of vertices V, a loop-erased random walk from x to ^ is the path LE{X[0, T]), where Xq = x and 
T = mm{t : Xt £ V}. At times we will consider random walks that stay in their current position 
with probability 1/2 and move to a randomly chosen neighboring vertex with probability 1/2. 
However, it is clear from the definition that repeating vertices along a path does not affect the 
loop erasure of the path. 

It is well-known that there are strong connections between uniform spanning trees and loop- 
erased random walks. Pemantle [1A\ showed that if x and y are vertices in a finite graph G, then 
the loop-erased random walk from x to y has the same distribution as the path from x to y in 
a uniform spanning tree of G. Pemantle's result, in combination with Theorem ll.il immediately 
gives the following corollary. 

Corollary 1.2. Let x and y be points chosen independently and uniformly at random from Z^. 
Let X = (Xj)^Q be a random walk on with Xq = x, and let T = min{t : Xt = y}. Then there 
is a sequence of constants (7^)^!, bounded away from zero and infinity, such that for all x > 0, 

lim P(|LS(X[0,r])| >7„n2(logn)i/^x) = e'^'/^^ 



The limiting distribution in Corollary ll.2l is known as the Rayleigh distribution. It arises because 
the probability that the points zi and Z2 in the CRT are a distance at least x apart is the 
probability that there are no points in [0, x\ in an inhomogeneous Poisson process of rate r{t) = t, 
which is e~^ 

Wilson [18] established an even stronger connection between uniform spanning trees and loop- 
erased random walks by discovering an algorithm for constructing uniform spanning trees using 
loop-erased random walks. The algorithm proceeds as follows. First pick some vertex xq in the 
graph G, and let Tq = {xq}. Then given a tree Tj for some i > 0, choose a vertex Xj+i in G and 
define the tree Tj+i by adjoining to Tj the loop-erased random walk from Xj+i to Tj. Because 
loops are erased, each Tj is a tree, and if we continue the process until Tj contains every vertex of 
G, the tree Tj is a uniform spanning tree. We emphasize that the algorithm works for any choice 
of the vertices xq, xi, . . . , with the choice of Xj+i even being permitted to depend on the tree Tj. 
Our proof of Theorem 11.11 which we outline in the next subsection, will use Wilson's algorithm 
to compare uniform spanning trees on and on the complete graph Km- 

1.2 Outline of the proof 

In this subsection, we outline the main ideas in the proof of Theorem 11.11 The structure of the 
proof is borrowed from the proof of Peres and Revelle [15] for d > 5. The idea is to couple 
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a uniform spanning tree on with a uniform spanning tree on the complete graph, so that 
Theorem 1 1 . 1 1 wih follow from this coupling and Aldous' result that the CRT arises as the scaling 
limit of the uniform spanning tree on the complete graph. However, some of the steps in the 
proof involve applying results about intersections of four-dimensional random walks which are 
more intricate than the results needed to push through the proof when d > 5. 

Let xi,...,Xk be points chosen independently and uniformly at random from Z^, and let 
yi,...,yk be points chosen independently and uniformly at random from the complete graph 
Km- We can construct a uniform spanning tree on Km by using Wilson's algorithm, starting 
the first k random walks from the points yi,. . . ,yk- Note that for the purposes of studying the 
joint distribution of the distances between pairs of these points, we can stop the algorithm after 
k steps because once yi and yj are in the tree, the distance between them will not change when 
other branches are added to the tree. Likewise, we can construct a uniform spanning tree on Z^ 
by using Wilson's algorithm and starting the first k walks from xi, . . . , x^. 

To couple these two spanning trees, we break the random walks on Z^ into segments, with 
each segment corresponding to a single step of a random walk on Km- Then, for the coupling, we 
collapse the vertices in the spanning tree on Z^ that come from the same segment of a random 
walk to a single vertex. There is some flexibility in the choice of the length of these segments, 
but to obtain the best bound for the probability that the coupling succeeds, we choose the length 
of the segments to be 

r= [n2(logn)9/22j_ 

Within a segment of length r, the random walk will make many short loops that get erased 
during the construction. Long loops, in which one segment intersects the loop-erasure of a 
previous segment, can cause entire segments to get erased. 

On Km, assuming there is a self- loop at every vertex, two steps of a random walk coincide with 
probability 1/m. Therefore, for a long loop to be equally likely to form in both constructions, 
enabling the coupling to succeed, the probability that a random walk segment on Z^ of length r 
intersects the loop-erasure of another such segment must be approximately 1/m. Two random 
walks on Z^ of length r intersect with probability of order r^/(n'* log n) ~ (logn)^^/^-*^ (see 
Propositions 12.81 and \2M below), and from results of Lyons, Peres, and Schramm [13], it will 
follow that the probability that a random walk of length r intersects a loop-erased walk of length 
r is the same order of magnitude. Therefore, we need to choose m to be of order (logn)^/^^. 
More precisely, following [15], we define the capacity of a set C/ C Z^ to be 

Cap^(C/) = P{Xt G U for some t < r), (2) 

where (^j)^o ^ random walk on Z^ started at a uniformly chosen point. We will show in Propo- 
sition [3^ that the capacity of a loop-erased random walk of length r is tightly concentrated around 
its mean of a„(logn)~^/-^^, for some constant a„. Therefore, if we choose m = [a~^(log n)^/^"*^] , 
then the probability that the next segment of length r intersects the loop-erasure of a given 
previous segments is always very close to 1/m, enabling the coupling with the process on the 
complete graph to hold with high probability. Because r is much longer than the mixing time of a 
random walk on Z^, the fact that the starting point of each segment is not chosen independently 
at random does not have a large effect on these estimates. 

When the coupling of spanning trees holds, the number of vertices along the path in the 
spanning tree on Km from y^ to yj is the same as the number of segments of length r along 
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the path in the spanning tree on from Xi to Xj. To couple the distances, it is necessary to 
estimate the length of the paths of length r after loop-erasure, and to show that the distribution 
of this length is highly concentrated around its mean. It follows from a result of Lawler [11] 
that the fraction of the points on each segment that are retained after loop-erasure is of order 
(logn)~^/^, so the length of the segments after loop-erasure is of order r/(logn)^/^. We will show 
in Proposition I3.H using arguments in [9], that this length is concentrated around its mean. 

There is one further complication that must be addressed. To carry out Wilson's algorithm 
on Z^, it is necessary to start the tree with an initial vertex. Then we start a random walk from 
xi and run the walk until it hits that vertex. However, the time before a random walk on 
hits a given vertex is of order n^, much longer than the length of a loop-erased random walk 
between two points. To avoid having to run a random walk for a time of order n^, we again use a 
technique of Peres and Revelle |15j by adding a root vertex to each graph. Given a > 0, let Km,a 
be a weighted graph constructed by starting with Km, giving every edge of Km (including the 
self- loop at every vertex) weight 1, and then adding a root vertex p that is connected to every 
other vertex in the graph by an edge of weight mlia^fm — 1). This means a weighted random 
walk on Km,,a that on each step moves to a neighboring vertex with probabilities proportional 
to the edge weights is the same as the simple random walk on Km, except that it jumps to p 
after a number of steps which is geometrically distributed with mean a^/m. Likewise, let G„^^ be 
obtained by adding to Z^ a root vertex p, which is connected to every other vertex in the graph 
by an edge of weight 8/(/3n^(log n)^/^ — 1). The weighted random walk on Z^ is the same as the 
simple random walk on Z^, until it jumps to p after a number of steps which is geometrically 
distributed with mean /3n^(log n)^/^. Given /?, we choose a so that 



a^/m \ /3n2(logn)i/2 

This choice ensures that the probability that a single step of a random walk on Km,a visits the 
root is the same as the probability that an r-step random walk on Gn^p visits the root. It is 
then possible to couple uniform spanning trees on Km,a and Gn,p by starting the tree with the 
root vertex, so that the initial random walks started from y\ and x\ respectively are run until 
they hit the root. Once we have uniform spanning trees on Km,a and Gn,/3) we can remove the 
edges leading to the root to get spanning forests on Km and Z^. As /3 ^ oo, the probability 
that x\,...^Xn are in the same tree component of the spanning forest tends to one, so we can 
use results on stochastic domination of spanning forests by spanning trees, as in [15j, to couple 
the spanning trees on Km and Z^. 

We conclude this discussion by observing that, with the above picture in mind, there is a 
simple explanation for why the exponent 1/6 arises in the logarithmic correction. If we run two 
random walks started from x\ and x^ for time L > nlogn, then they intersect with probability 
of order min{l, L'^ /{n^ logn)}, which is of order one when L = n^(logn)^/^. Therefore, the path 
in the uniform spanning tree between xi and X2 comes from random walk paths whose lengths 
are of order n^(logn)^/^. The fraction of points along these paths that survive loop-erasure is of 
order (logn)"^/'^, so the lengths of the remaining paths are of order n^(logn)^/^. 

The rest of this paper is organized as follows. In section 2, we prove some results about 
random walks on the four-dimensional torus that will be needed to study loop-erased random 
walks. In section 3, we study loop-erased walks on the torus, focusing on obtaining tight bounds 
on the length and capacity of a loop-erased segment. Then in section 4, we use these results to 
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couple a uniform spanning tree on with a uniform spanning tree on and complete the 
proof of Theorem 11.11 



2 Random walks on the four-dimensional torus 

In this section, we establish some facts about random walks on that we will need to study the 
loop-erased random walk and the uniform spanning tree. Most of the key ideas come from the 
book of Lawler [9], which contains numerous results for random walks on Z^. The work in this 
section primarily involves establishing the analogous results for random walks on the torus. 

We denote the point (0, 0, 0, 0) by 0, and if x = (xi, . . . , X4) G Z^, we denote the Euclidean 
norm by \x\ = {x\ + • • • + It will be convenient to work with aperiodic random walks. 

Therefore, when we say that X = (Xt)^Q is a random walk on Z^, we will assume that at each 
step, the random walk stays in its current position with probability 1/2. That is, for all t, we 
have P{Xt+i = Xt\Xt) = 1/2 and, if x is one of the 8 points in Z^ such that = 1, then 
= Xt + x\Xt) = 1/16. To apply results of Lawler [9] about random walks on z^ it 
will be necessary to consider also random walks on Z^ that never stay in their current position. 
Therefore, if Z = {Zt)^Q is a random walk on Z^, we will specify that Z is a simple random 
walk if = Zt\Zt) = and a lazy random walk if = Zt\Zt) = 1/2. In both cases, 

when the random walk does not stay in its current position, it moves to one of the 8 neighboring 
points with equal probability. All of our random walks will be in discrete time. 

Throughout the paper, C, C , Ci, C2, . . . will denote positive constants that do not depend on 
n but whose values may change from line to line. 



2.1 Bounds on transition probabilities 



In this subsection, we will establish some bounds on transition probabilities for random walks on 
Zf^. Let X = {Xt 

be a random walk on Z^. Let Y — (^4)^0 ^"6 a simple random walk on 



Z , and let Z 



\oo 

{ZtT 



t=0 



be a lazy random walk on Z . For all x, y G Z„, let pt,n{x, y) = P{Xt 



y\Xo = x). For ah x, y G Z , let ^^(x, y) = P(Yt = y\Yo = x) and let qt{x, y) = P{Zt = y\Zo = x). 
The following result for the simple random walk, which is a version of the Local Central Limit 
Theorem, is the four-dimensional case of Theorem 1.2.1 in [9]. 



Lemma 2.1. There exists a constant C such that for a// 1 G N and x = (xi, , 
that t + xi + • • • + X4 is even, we have 



,X4) G 



such 



-2\x\^/t 



IT- 



2*2 



< C min < — 



1 



1 



t3't2| 



The following large deviations result, known as Azuma's Inequality, will be useful for deducing 
properties of the lazy random walk from properties of the simple random walk. If (Aft)^Q is a 
martingale with Mq = and iM^+i — Mt\ < c for all t, then 

P{Mt > x) < e~^''/2^'*. (4) 

See section 2.4 of [7] for a proof. Azuma's Inequality and Lemma [2. II lead to the following bounds 
for transition probabilities of the lazy random walk. 
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Lemma 2.2. There exists a constant Ci such that 

qt{0,0)<^ (5) 

for all t e N. There also exists a constant C2 such that for allt^'H and all x £ Z^, we have 

gt(0,x)<|%. (6) 

Proof. It follows from the 2; = case of Lemma 12.11 that there exists a constant C such that 
?t(0, 0) < Ct~^ for all t G N. To obtain the bound for the lazy random walk, we condition 
on the number Nt of steps s < t such that Zg ^ Zg-i- If Mt = t/2 — Nt, then (Mj)^q is a 
martingale with \Mt+i - Mt\ < 1/2 for all t. Therefore P{Nt < t/i) < e"*/^ hy It follows 
that qt{0,0) < C(t/4)-2 + 6"*/^, which implies (l5|). 

To prove ([6]), note that if a > 0, the function t 1— > i~^e~^"/*, defined for t £ (0,cxd), is 
maximized when t = a. Therefore, t~^e~^l^l < e~^|x|~^ for all t G N and x G Z^. Since 
q'i.{0,x) = when t < \x\ and < when t > we get gj(0,x) < C|x|~^. Because 

this bound does not depend on t, we get the same bound for qt{0,x) by conditioning on Nt. □ 

The next step is to use these bounds on the transition probabilities for the random walk on 
Z*^ to obtain bounds on the transition probabilities for the random walk on the torus Z^. Note 
that the random walks X and Z can be coupled so that Xf = Zt (mod n) for all t. 

Proposition 2.3. Suppose x = (xi, . . . , X4) is a point in Z^ such that \xi\ < n/2 for i = 1, . . . , 4. 

Let x' = {x'l, . . . ,x'^) G Z^ be defined so that x[ = Xj (mod n) for i = 1, ... ,4. Then for any 
constant C , there is a constant C such that for all t < C'n^ , we have 

qtix, 0) < pt,n(x', 0) < qtix, 0) + (7) 



Proof. Let V = (nZ)^. It is clear from the coupling between X and Z that 

pt,n{x',0) = '^qt{x,y). (8) 

The first inequality in ([7]) follows immediately. To obtain the second inequality, we divide Z^ 
into boxes. Given y = {yi, . . . , y^) G Z^, let 

B{y) = {{zi, . . . ,Zi) ■.\yi - Zi\ < n/2 for i = 1, . . . ,4}. 

Denote by dB{y) the set of all points in B{y) having a neighbor that is not in B{y). 

Assume that Zq = x, so qt{x,y) = P{Zt = y). Suppose y G F \ {0}. If Zt = y, then 
Zs G dB{y) for some s <t. Because \y — z\> {n — l)/2 for all z G dB{y), equation Q implies 
there is a constant C such that for all s < t and all z G dB{y) we have P{Zt = y\Zs = z) < Cn~^. 
By applying the strong Markov property at the stopping time min{t : Zt G B{y)}, it follows that 
qt{x,y) < Cn~'^P[Zs G B[y) for some s < t). Therefore, it remains to show that there is a 
constant C such that 

P{Zt G B{y) for some s<t)<C. 

yev\{0} 
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For k > 1, the cardinaUty of 

Vk = <y = [yi, ■ ■ ■ ,yA) ^ V : — < max \xi - yi\ < 

is at most (A; + 1)^. For y £ V^, there is an i G {1, ... ,4} such that \xi — Zi\ > (k — l)n/2 for 
ah z = (zi, . . . , Zi) G B{y). Let Zi^t be the ith coordinate of Zt, and note that {Zi^t)^o is a 
martingale. By applying the Reflection Principle followed by (jH) to {Zi^t)^o, we get 

P{Zt G B{y) for some s < t) < 2P{\Zi^t\ > (k - l)n/2) < 4e-{'^-i)'"V8t^ 

and the right-hand side is at most 4e~^^~^^'^^^'^' whenever t < C'-n? . It follows that 

oo 

P{Zt e B{y) for some s < t) < 4(A: + l)4e-(fc-i)V8C' < 
y6y\{o} k=i 

which implies the proposition. □ 

Corollary 2.4. For any constant C , there exists a constant C such that Pt,n(0,0) < C/t'^ for 
all t such that 1 < t < C'n'^. 

Proof. Since < (C'/t)^, the result follows immediately from ([5]) and the upper bound in 

Proposition I2.3[ □ 

Proposition [2?3] and Corollarv l2.4l applv only when t < C'v? . To bound transition probabilities 
when t > C'v?, we will need facts about the mixing time of the random walk X, which are 
discussed in the next subsection. 

2.2 Mixing time 

The random walk X is an irreducible, aperiodic Markov chain whose stationary distribution is 
uniform on Z^. Therefore (see, for example, chapter 5 of [5]), the distribution of Xt converges 
as t — > oo to the uniform distribution on Z^, which we denote by vr. To measure the speed of 
convergence, we define the uniform mixing time 



Tn = min < t : sup 



Pt,n{0,x) - 



rC* 



1 

- 2^ 



For any two probability measures /i and v on Z^, define 

d{^,u) = sup n'^\fi{x) — i'{x)\, 

so Tn = min{t : d{pt^n{0, O;^) ^ I/^}- A standard subadditivity result on convergence for 
Markov chains (see, for example, the p = oo case of Proposition 2.2 in [16]) implies that if 
d{pt,n{^, O'^) ^ P and s>t, then (i(ps,n(0, O'^) ^ Z?'-'*^*-'. In particular, for all t > Tn, we have 



sup 

x&Zt 



, , 1 
Pt,n(0,x) - ^ 



<n-^2-L*/rnJ^ (9) 



8 



It is well-known that there exist constants < Ci < C2 < oo such that 

Cin^ < r„ < Csn^ for all n. (10) 

See, for example, the calculations in chapter 5 of [3]. The notion of mixing time will be useful in 
what follows primarily because of the following lemma. 

Lemma 2.5. Let f be a nonnegative function of two paths in 1^ of lengths L + 1 and M + 1. 
Suppose X = (Xj)^Q and Y = (li)^o ^'^^ independent random walks on Z^, and suppose Yq has 
the uniform distribution on Z^. If d > Tn, then 

E[f{X% L],X[L + d,L + M + d])] < lE[f{X[0, L],Y[0, M])]. 

Proof. Let x = (xq, • • • ,xl) be a path in of length L + 1, and let y = (yo, • • • , Um) be a path 
in Z^ of length M + 1. By the definition of r„, 

3 3 

P{XL+d = yo\X[{),L\ =x) < maxprf,„(x,2;) = maxpd.n(0,z) < — ^ = -P{Yq = yQ\X[{),L\ = x). 

Therefore, by the Markov property, 

P(X[0, L] = X, X\L + d,L + M + d]=y) 

= P{X[0, L] = x)P{X[L + d,L + M + d]= y\X[0, L] = x) 

< ^P{X[0, L] = x)P{Y[0, M] = y\X[0, L] = x) 

= ^P{X[0,L]=x,Y[0,M]=y), 
and the result follows easily. □ 
2.3 Intersections of two walks 

Our goal in this subsection is to prove upper and lower bounds for the probability that two random 
walks on the torus intersect. Our first lemma concerns the expected number of intersections of 
two random walks started at the origin. 

Lemma 2.6. Let {Xt)'^^ and (i^)^o ^^'^ independent random walks on Z^ started at the 
origin. Let 

L L 
s=0 t=0 

be the number of intersections. If L < C'n^{logn)^^'^ for some constant C' , then there exists a 
constant C such that E[R\ < Clogn. 

Proof Note that P{Xs = Yt) = P{Xs+t = 0). It follows from ^ and Corollary [23] that there 
is a constant C such that P{Xt = 0) < C/t^ if 1 < t < r„. If t > r„, then P{Xt = 0) < 3/2n^. 
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Therefore, 



L L 



s=0 t=0 



t=Q 



3(L + 1) 
2n4 



+ 3(L + 1)2 



i2 



+ 



2n4 



< C logr„ + ^ , 



which gives the result. 



□ 



For all X G Z^, denote by Gl{x) the expected number of times that a random walk of length 
L started at visits x. That is, if (Xt)^Q is a random walk on started at 0, then 

L 

GL{x) = Y,nxt = x). 

t=0 

Likewise, for x E Z^, let G*j^{x) be the expected number of times that a simple random walk on 
of length L started at visits x. Let G*{x) be the expected number of times that a simple 
random walk on of infinite length started at visits a;, so G* is the standard Green's function 
for random walks on Z''. 

Lemma 2.7. Let X = (Xt)^o be a random walk on Z^ started at the origin. Let 

n 

Dn = Y,Gn{Xt). 

t=0 

There exist constants Ci and C2 such that P{Dn < Ci logn) < C2/logn for all n. 

Proof. Let Z = {Zt)^Q be a lazy random walk on Z"^, coupled with X so that Xt = Zt (mod n) 
for all t. Let Z' = {Z^)'^^ be another lazy random walk on Z^ started at 0, independent of Z, 
and let X' = (X^)^q be the corresponding random walk on Z^ such that X'^. = Z'^. (mod n) for 
all t. Since Xt = X' whenever Zt = Z' we have 



Dn = E 



t=0 s=0 



{Xt=X',} 



X 



> E 



t=0 s=0 



{Zt=Z',} 



(11) 



Let Tq = Tq = 0, and for /c > 1, let = min{t > Tfe_i : Zt 7^ Zxf._^} and T^, = min{t > T'^^_i '■ 
Z't i-ZL }. Then let Yt = Zt^ and = ZL for ah t > 0, so Y = (^4)^0 and Y' = (F/)^^o are 

simple random walks on Z^ started at 0. 
Let 

Ln/4J 

£>* = ^ G*(Yt). 



Lawler showed that there exist constants C3 and C4 such that 

C4 



P(D* < Calogn) < 



logn 



(12) 
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(see p. 101 of ^J, or use Proposition 3.4.1 of [9j in combination with Chebyshev's Inequality). 
Note that g^(0,0) < C/s^ for all s > 1 by Lemma EH Therefore, 



E 



Ln/4J 



Ln/4J 



^ {G*{Yt) - G]^„/4j(l^*)) =Y. E ^(^* = ^s) < (K4J +1) E 9^(0' 0) ^ ^- 

-I t=0 s=[n/4j+l s=[n/4j+l 



t=0 

It follows that if 



Ln/4J Lri/4J Ln/4J 

= E GW4j(^*) = ^ E E Mys=yn 

i=0 L t=0 s=0 



Y 



(13) 



then P{D* — D* > (C3 log n) /2) < C/(log n) by Markov's Inequality. Combining this observation 
with (fT2]) . we get 



(14) 

2 y - logn ^ ^ 

Azuma's Inequality (HD gives 

^(^L«/4J > ™) ^ e""^® and PiT^^/^i > n) < e-"/^. Therefore, 
by ([II]) and ([l3]), we have Dn> D*-{1 + n/4)2e""/^ on the event that rL„/4j < n. The lemma 
now follows from (|14|) . □ 

Our main result concerning intersection probabilities of two random walks on the torus is 
the following. We prove it by using essentially the same proof used by Lawler to bound the 
intersection probability of two random walks in (see the proof of the upper bound of Theorem 
3.3.2 in [9]). 

Proposition 2.8. Let (Xt)^Q and {Yt)'^Q be independent random walks on Z^. Assume that Yq 
has the uniform distribution on Z^. Also, assume there is a constant C such that K > C'nlogn 
and L > C'nlogn. Then there is a constant C such that 



P{Xs = Yt for some s < K and t < L) < 



CKL 
log n 



Proof. Set K = min{t > : Xf = Yg for some s < L}, and let a = min{s > : = Yg}. We 
need to bound P{k < K). Let 

J=[s: J2Gn{Yt+s - y.) < Cilognj, 

where Ci is the constant from Lemma 12.71 For all t = 0, 1, . . . , X and s = 0, 1, . . . , L, we have 
P{Xt = Ys) = 1/n^ because Yg has the stationary distribution tt. Also, the event that s S J is 
independent of the event that Xt = Ys. By Lemma [2.71 we have P{s G J) < C2/(logn) whenever 
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Also, 



Let 



s < L — n. Putting these results together, we get 

L 

P{K<K,a£j) = ^P{K<K,a = s,s£ J) 

K L-n K L 

t=0 s=0 t=0 s=L-n+l 

K L—n ^ K L ^ 

t=0 s=0 ^ t=0 s=L-n+l 

^ C2{K + l)L K + l 

< 4i \ 3—- (15) 

log n n-^ 

K L (T 
P{K-n<K<K)< E^(-^* = ^-)= J • (16) 

t=K-n+l s=0 
K L 

t=0 s=0 

By the definition of J, we can apply the strong Markov property to {Xt)j^Q while holding the 
path (It)t^o fixed to get 

E[R\k < K -n,a J]> Cilogn. 

It follows that 

P(. ^ ^ . 1 f ^ . >,) . (™±il. (17) 

Cilogn Cilogn'^^'^^ Cin^logn 

The result now follows from (jlSp . (jl6p . and (|17p . □ 

The next result gives a lower bound for the intersection probability when the two walks have 
the same length. 

Proposition 2.9. Let X = {Xt)'^Q and Y = (It)^o independent random walks on Z^. 
Assume that Yq has the uniform distribution on Z^. // there is a constant C such that L < 
C'n^ (log n)^/^, then there is a constant C such that 

P{Xs = Yf for some s < L and t < L) > 



log n 



In the proof of this proposition, as well as later in the paper, we will use the following well- 
known result (see, for example, (3.5) on p. 95 of [9]), which gives a lower bound on the probability 
that a nonnegative random variable is strictly positive. 

Lemma 2.10. Let Z be a nonnegative random variable. Then P{Z > 0) > {E[Z])'^ / E[Z'^]. 
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Proof of Proposition \2.9l Let 

L L 

Since Yq has the uniform distribution, P{Xg = Yt) = for ah s and t, so 

Also, if s < n, then by the Markov property and time reversibihty of Y, we have 

P{Xu = Y,\Xs = Yt) = P{Xu-s = Y\,_t\\Xo = Yo). 



Thus, 



EIR^] = E 



L L L L n 

5^ Mx.s=Yt}Mx^=Y^} 
s=0 t=0 u=0 v=0 ^ 
L L L L 

<'^^J2J2Y1 = "^t)P{Xu = X,\Xs = Yt] 

s=0 t=0 u=s v=0 
L L L L 

s=0 t=0 u=s v=0 



^EEEE:^.P(^^ = ^r\Xo = Yo) 

s=0 t=0 g=0 r=0 
q=0 r=0 



where the last inequality follows from Lemma 12.61 By Lemma 12.101 



E[R^] - nMogn' 

and the result follows. □ 



2.4 Self-intersections 

We now use Proposition 12.81 to bound the probability that a random walk on intersects its 
previous path. These bounds will be important for the study of the loop-erased random walk 
on Z^. Proposition 12.111 below shows that random walks whose length is much shorter than 
n^(logn)^/^ are unlikely to make loops whose length is much larger than n?. 

Proposition 2.11. Let (Xt)^Q be a random walk on Z^. There exists a constant C such that 

P{Xt = Xs for some s < L and t < L such that t > s + 2r„) < — j- . 

n'* log n 
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Proof. We may assume L > 2r„. Let m = \{L + l)/rn], and for i = l,...,m, let Bi = 
{{i — 1)t„, . . . , iTn — 1}. If s + 2r„ < t < L, then there exist Bi and Bj with j > i + 2 such that 
s £ Bi and t £ Bj. If j > i + 2, then by Lemma 12.51 and Proposition 12.81 



3Cr 

P{Xt = Xs for some s € -Bj and t £ Bj) < 



2n^ log n ' 



where C is the constant from Proposition I2.8[ Since the number of pairs of segments Bi and Bj 
is at most m?, it follows that 

/L + 1 \^ 3Cr^ 
P{Xt = Xg for some s and t such that t > s + 2r„) < I hi 



2n^ log n ' 

which gives the result. □ 

Our next result uses the coupling introduced in subsection 12. II between a random walk on 
and a random walk on Z^. We show that a random walk on whose length is much shorter than 
n-^(logn)^/^ is unlikely to have loops that do not also appear in the corresponding walk on Z"^. 
This is closely related to Proposition 12.111 because typically short loops are present in random 
walks on Z^ while long loops occur only on the torus. 

Proposition 2.12. Let (Xt)^Q be a random walk on Z^. Let {Zt)'^Q be a lazy random walk on 
Z^. Assume the random walks are coupled so that Xt = Zt (mod n) for all t. There is a constant 
C such that 

CL"^ 

PiXg = Xt and Zg / Zt for some s < L,t < L) < . 

log n 



Proof. The proof is similar to the proof of Proposition 12.81 Define k = min{t > : Xg = 
Xt and Zs ^ Zt for some s < t}, so k is the first time that there is a loop in the walk on Z^ that 
does not correspond to a loop in the walk on Z^. Let a = min{s > : Xg = X^ and Zg ^ Z,^}. 
We need to bound P(k < L). Note that if Xg = Xt and t — s < n, then Zg = Zt. Therefore, if 
K < L, then a < k — n. Define 

ln/3\ 

■J=\'-Y1 GL„/3j(Xt+,-X,)<Cilog(Ln/3j) , 

where Ci is the constant from Lemma 12.71 

Proposition 12.31 implies that that there is a constant C such that pt^„(0, 0) — gj(0, 0) < C/n'^ 
whenever 1 < t < t„, while ([9|) and the definition of r„ imply that if t > r„, then we have 
Pt,n{0, 0) — qt{0, 0) < Pt,n{0, 0) < 3/2n^. Therefore, there is a constant C such that if s < t, then 

PiXg = Xt and Zg / Zt)=pt-g,niO,0) - gt-.(0,0) < (18) 

It follows that 

L L 

P{L - Ln/3J < K < L) < ^(^^ = + ^ (19) 

s=0 t=max{s+l,L-[n/3j} 
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Let 

L L 
s=0 t=s+l 

By (HI]), there is a constant C such that E{B\ < CL?/n^. Note that if Xs = Xt and Zs ^ Zt, 
then if X^+u = XtJ^^ for some u and v such that + f < n, we have Z^+u 7^ -^t+tj- It follows by 
applying the strong Markov property at time k that 

E[R\k <L - [n/3\,a i J]> Ci log([n/3j). 

Therefore, 

P{k<L- Ln/3J ,aiJ)< ^ .^jf < (20) 

Cilog([n/3J) n^logn 

By Lemma 12.71 there is a constant C such that P{s G J) < C/(logn) for all s < L — n/3. 
Therefore, 

L—n L 



P{k < L,a G J) = Y^ P{cr = s,K = t,s £ J) 

s=0 t=s+n 
L~n L 



s=0 t=s+n 
„ L—n L 

The next step is to use Proposition 12.31 to bound P{Xs = Xt,Zs 7^ Zt\s G J). The key idea 
is that the event {s £ J} depends only on the random variables Zg, Zs+i, ■ ■ ■ , Z^j^y^m , and so 
conditioning on this event does not have a large effect on the probability that Xg = Xt and 
Zs 7^ Zf. Let B = {x = (xi, . . . ,X4) G : \xi\ < [n/3\ for i = 1, . . . ,4}. Given x G Z^, let 
x' G be defined so that x' = x (mod n). Therefore, using the Markov property at the time 
s + [n/3j , we get that for t > s + n, 

P{Xs = Xt, Zs ^ Zt\s G J) < max max P{Xs = Xt, Zg / Zt\s e J,Zs = x, ^s+|n/3| = u) 

= max max P{Xs = Xt, Zs / Zt\Zs = x, ^s+|„/3| = y) 

= max (pt^^^L„/3j,n(2',0) - qt-s-[n/3\{z,0)) . (22) 

By Proposition 12.31 there is a constant C such that Pt-s-[n/3\,n{^' ~ 'J't-s-[n/3j (-2, 0) < C/n^ 
whenever z ^ B and t — s — \n/3\ < Tn- If t — s — [ri/3j > t„ then 

Pt-s-Ln/3j,n(^',0) - qt-s-[n/3\iz,^) < Pt~s~[n/3\,n{^' ^ 3/2n''. 



Combining these bounds with ()21l) and (|22p . we get 

P(A^<L,aG J) < . (23) 

log n 

The proposition follows from (jl9p . (j20p . and ()23p when L > nlogn. The result follows from 
Azuma's Inequality ^ when L < n log n because in this case it takes longer than time L for the 
random walk to move a distance n. □ 
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2.5 Local cutpoints 

Let (Xt) be a random walk on Z^. Using terminology similar to that of Peres and Revelle 
[15] . we call the time u a /c-local outpoint if 

{^u-k, ■ ■ ■ , -'^M-i} n {Xu+i, . . . , Xu+k} = 0- 

In other words, the portion of the random walk between times u — k and n — 1 does not intersect 
the portion between times n + 1 and u + k. We call a 2rn-local cutpoint simply a local cutpoint. 
Our goal in this subsection is to prove that for random walks on Z^, with high probability 
every sufficiently long segment contains at least one local cutpoint. More precisely, we have the 
following proposition. 

Proposition 2.13. Let C > be a constant. Let {Xt)^^Q be a random walk on Z^, and suppose 
L < C"n^(log n)"*^/^. Then for all 6 > 0, there is a constant C such that with probability at least 
1 — C/(logn), every interval [t, t + n^(log n)^] with < t < L — n^(logn)^ contains a local cutpoint. 

Before proving Proposition 12.131 we prove two lemmas. The first gives an estimate on the 
probability that a given point is a local cutpoint. 

Lemma 2.14. Let X = {Xt)^Q and X' = {X^)'^^ be independent random walks on Z^ starting 
at 0. For all m, let fn{m) be the probability that the two walks do not intersect through time m, 
so 

fnipi) = P{Xs / X[ whenever 1 < s < m,l < t < m). 

Then 

fn{2Tn) 1 



lim 



fn[n) 



Proof. Let Z = (Zt)^Q and Z' = (Z^)^q be two lazy random walks on Z^ starting at 0. For all 
m, let 

g{m) = P{Zs 7^ Z[ whenever l<s<m, l<t< m). 
Lawler |10j showed that there is a constant Co such that 

~ n ^°u/2 (24) 
(logmj^/^ 

where ~ means that the ratio of the two sides tends to 1 as m ^ oo. Lawler's estimate was stated 
for simple random walks, rather than lazy random walks. However, ^ implies that the number 
of steps on which a lazy random walk of length m does not stay in the same place is between 
m/4 and m with probability at least 1 — e"™'/^, so it is easy to see that Lawler's estimate also 
holds for the lazy random walk. 
We now claim that 

lim = lim = 1. (25) 



+ 00 



We may assume that the random walks are coupled so that Xt = Zt (mod n) and X[ = Z[ (mod 
n) for all t. Fix m < 2r„. Define a random walk (l^)^™o concatenating the walks Z and 

Z' . More precisely, define Yt = Z'^_t — Z'^ for < t < m and Yt = Zt~m — Z'^ for m+1 < t < 2m. 
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If Zg = Z[ for some 1 < s < m and 1 < t < m, then Ym-t = ^+s- Now define a random walk 
^t)\=Q on such that Wt = Vt (mod n) for < t < 2m. If = X[ but / Z^' for some 
1 < s < m and 1 < t < m, then y^.^ / Ym+s but y„_t = y„+s (mod n), so W^-t = Wm+s- 
Since m < 2t„, it follows from (jlOp and Proposition 12.121 that 

Cr^ C 

Piy/^ = Wt and / y* for some s < t < 2m) < " < 



log n log n 

Therefore, < g{m) — fn{^) ^ C/i^ogn) for all m < 2Tn- In view of (j24p . equation (j25p follows. 
Using dMl), (125]), and (dUD, we get 



n^oo Jn{n) n->oo g(n) n^oo (logfl)"^/^ ^2 

as claimed. □ 

Lemma 2.15. Let {Xt)'^Q be a random walk on Z^. There exists a constant b > such that if 
L < 5r„, the probability that X[0,L] has a local cutpoint in [2r„,L — 2r„] is at least b. 

Proof. Choose points xi, . . . ,Xk G [2r„, L — 2r„] such that Xj+i — Xi > 2n for i = 1, . . . , A; — 1. 
Since L > 5r„, this can be done so that k > Cn for some positive constant C. Let Ai be the 
event that Xi is an n-local cutpoint. Since Xj+i — Xi > 2n, the events Ai, . . . , are independent, 
and each has probability fn{n). Let Y be the number of the points xi, . . . that are n-local 
cutpoints. Then Y has the binomial distribution with parameters k and fn{n). Let Z be the 
number of the points xi, . . . , that are local cutpoints. Then Z <Y . Since 

P{x is a local cutpoint|x is an n-local cutpoint) = ^"^^ 



we have, for < a < 1/a/2, 



/n(n) 



for sufficiently large n by Lemma 12.141 Using Lemma l2.10( we have, for sufficiently large n, 

P(7 > m > ^-^t^])' > "'(-^[^])' = «^(fc/n(n))^ 

^ ^ - - A:/„(n)(l-/„(n)) + (fc/„(n))2- ^ ^ 

Note that fc/„(n) — > cxd as n ^ oo by ()24p . (j25p . and the fact that k > Cn. Thus, the right-hand 
side of ()26p approaches > as n ^ cxd, and the lemma follows. □ 

Proof of Proposition [2. 131 Divide [0,L] into disjoint subintervals of length |n^(logn)^, throwing 
away any remainder at the end of the interval. If each of these subintervals has a local cutpoint, 
then so does every interval of length n'^(log n)^ . Therefore, it suffices to show that with probability 
at least 1 — C/(log n), each subinterval has a local cutpoint. Note that the number of subintervals 
is at most 2C"(log n)^/^-^ 

Now split an interval / of length ^n^(logn)^ into disjoint subintervals Ji, . . . , of length 
5Tn, again throwing away any remainder. By (llOp . m > Ci(logn)^ for some constant Ci > 0. 
For j = 1, . . . ,m, let Aj be the event that there is a local cutpoint in the segment of length r„ 
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in the middle of Jj. Since Aj depends only on the random walk during times in Jj, the events 
^1,..., 

are independent. By Lemma 12.151 P{Aj) > b for all j. Thus, the probability that 
I does not have a local outpoint is at most (1 — 6)"^ < (1 — Since there were at 

most 2C"(log n)^/^~^ intervals of length n^(logn)^, the probability that all of them have a local 
outpoint is at least 

l-2C'(logn)i/2-^(l-6fi(i°g")', 
which completes the proof. □ 

3 Properties of loop-erased segments 

In this section, we will consider a segment of a loop-erased random walk on the torus Z^. Our 
main goals are to show that the length and the capacity are highly concentrated around their 
means. As noted in the introduction, we will study the uniform spanning tree on by breaking 
up the random walks used in Wilson's algorithm into segments of length r = [n^(log n)^/^^J . 
Therefore, in this section we will be primarily concerned with random walks whose length is of 
order n^(logn)^/^^. 

3.1 Length of a loop-erased segment 

In this subsection, we show that the length of a segment of a loop-erased random walk on Z^ is 
highly concentrated around its mean. The argument relies heavily on results of Lawler [9j for the 
length of a loop-erased random walk on Z^. 

Proposition 3.1. Suppose X = (Xt)^Q is a random walk on Z^. Then there exist constants C 
and C and a sequence (cfc)^-^ with < inf Cfc < supcfc < cxd such that for all L, 



P 



|L^(X[0,L])| ^""^ 



(log L) 1/3 



> ^'^ ] ^^/ loglogL ^ L2 



(logL)3/4y- V(log^)^/^ n^logn 



By applying this proposition when L is of the order n^(log n)^/^^, we immediately get the 
following corollary. 

Corollary 3.2. Let X = (Xt)^Q be a random walk on Z^. Assume there are constants Ci and 
C2 such that Cin^(log n)^/^^ < L < C2n^(log n)^/^^. Then there exist constants C and C and a 
sequence ''i'ith < inf6„ < sup6„ < 00, all possibly depending on the values of L, such 

that for all n, 



|LS(X[0,L])| -6„n2(logn)5/66 



C'n' \ C 



^ (log n) 15/44 J - (logn)4/22' 



Proof of Proposition \3.1[ We may assume that X is coupled with a lazy random walk Z = {Zt 



't)t=0 

on Z^ such that Xt = Zt (mod n) for all t. Unless there exist two times s and t such that Xg = Xt 
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and 7^ Z^, we have |Li?(X[0, -L])| = |L£'(Z[0, because the two random walks make loops 
at the same times. Therefore, by Proposition 12.12] 



P{\LE{X%L\)\ + \LE{Z%L\)\) < 



CL'' 



log n ' 



(27) 



(yi)^o such a 



so we may focus on controlling |L£'(Z[0, -L])|. 

The lazy random walk Z can be coupled with a simple random walk Y 
way that if Tq = and = min{t > Tfe_i : Zf ^ Zx^_^} for k > 1, then Yf = Z^t for all t. Let 
M be the cardinality of {t : 1 < t < L and Xt ^ Xt-i}, which is independent of Y[0, L] and has 
the binomial distribution with parameters L and 1/2. Then 



\LE{Z[0,L])\ = \LE{Y[0,M])\. 



(28) 



It will at times be convenient to condition on M 
ities and expectations by Pm and Em- 
Let fJo = max{t : Yt = 0}, and for i > I, let cTj 
(To,cTi, . . . are said to be retained after loop-erasure. Let 1^ be the indicator of the event that 
k = Gi for some i. Let = E[Ik\ be the probability that the time k is retained after loop-erasure. 
Lawler |llj showed that there are constants Ci and C2 such that 



m, and we denote these conditional probabil- 
max{t > (Tj_i : Yt = yo-j_i+i}- The times 



(log/fc)V3 



(log A;) 1/3 



(29) 



for ah k>2. 

Now YlT=i is the number of points, out of the first m, that are retained after loop-erasure 
of the infinite walk. Call a point t loop-free if, whenever < s < t and u > t, we have Yg ^ Y^. 
By Lemma 7.7.4 of [9], there is a constant C such that for all m, the probability that there is 
a loop-free point between m — m{logm)~^ and m is at least 1 — C(loglogm)/(logm). On this 
event, no points of the walk before time m — m(logm)^^ can be erased after time m. Therefore, 



Y,h-\LEiY[0,m] 

k=l 



> 



m 



(log m)^ 



< 



C log log m 
log m 



(30) 



We now consider a finite loop-erasing procedure also considered in [5]. Choose = jo < 
< ... < = M such that M/2(logM)2 < ji - < 2M/{\ogMf for i = 1, . . . , w. If 
ji-i < k < ji, or \i i = 1 and A; = 0, let I'^ be the indicator of the event that the time k is retained 
after loop-erasure of the segment jj]. Lawler shows (see p. 207 of [9]) that 



Er, 



fc=0 



fc=0 



< 



Cm log logm 
log m 



(31) 



Lawler defines the times ji slightly differently, but it is easy to see that this does not affect the 
proof of (j3ip . Therefore, by Markov's Inequality, 



E^^-E4 



fc=0 



k=0 



> 



m 



C log log m 
(logm)3/^y ~ (logm)i/'i 



< 



(32) 
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Lawler also shows (see p. 208 of [Hj) that 



fc=0 



■ fc=0 



> 



(log m)^/2 



Err 



k=0 



< 



c 



(log m)^/^ 



(33) 



Since E[J2T=iIk] = EfcLi Ofc < Cm/{logm)^/^ by ([291), we can combine <^ with 1^ to get 
that there are constants C and C" such that for all m, 



fc=0 



■ fc=0 



> 



Cm 



< 



C 



(logm)^/^/ (logm)'^/^ 



(34) 



By ([5n|) . ([52]) . and there are constants C and C such that 



p 

J 77 



|Li?((y,)Ilo)l-^« 



fc=0 



> 



Cm \ C log log m 
(logm)3/4 ) - (logm)i/4 ■ 



By 



Therefore, 



We have 



P{\M - L/2\ > L^'^) < 2e~2^. 

M [L/2i 



(ivi L-"/ 
A:=0 fc=0 



> 1 + < 2e-2v^. 



(35) 

(36) 
(37) 



LL/2J 

|Lii;(y[0,M])| - afe 



fc=0 



< 



M 



M 



M 



M [LI2\ 

\LE{Y%M])\-E[Y,Ik\M] + E[Y^l'^\M]-E[Y,h\M] + ^a^- J^a^ 



fc=0 



fc=0 



fc=0 



fc=0 A;=0 



By conditioning on the value of M, we can use (I35p and (I36p to bound the first term, (1311) and 
([36|) to bound the second term, and (j37|) to bound the third term, we get 



pi \LE{Y%M])\ -Y^ak 
^ k=0 



^ CL ^^ ^ CloglogL 



(logL)3/4y - (logL)V4 ■ 



(38) 



For all k, let 



(logA:)V3 

cfc = — - — 2^ flj. 

j=0 



k 



It is easily seen from ([29]) that < inf Cfc < supcfc < oo. The proposition now follows from (|38|l . 
OTD, and mi. □ 
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3.2 Capacity of a loop-erased segment 

Let U and V be subsets of Z^. Let (Xj)^q be a random walk on such that Xq has the uniform 
distribution on Z^. Recall from ([2|) that the capacity of a set [/ C Z^ is defined by 

CaVhAU) = P{Xt G U for some t < M). 

As in |15j . define the closeness of the sets U and V by 

CloseM(f^, V) = P{Xt G [/ and X, e y for some s,t<M). 

In this subsection, we estimate the capacity of a loop-erased random walk segment, and show 
that the capacity is highly concentrated around its mean. The first step is the following bound 
on the closeness between two random walk segments. 

Proposition 3.3. Let X = {Xt)'^^ and Y = (yj)^o independent random walks on Z^, and 
assume that Yq has the uniform distribution on Z^. Assume there is a constant C such that 
K > C'nXogn, L > C'nXogn, M > C'nlogn, and u > t„. Let U = X[0,K], V = Y[0,L], and 
W = X[K + u, K + L + u]. Then there is a constant C such that 

E[CloseMiU,V)] < -2 and E[CloseMiU,W)] < 



n^(logn)2 ' n®(logn)'^ 



Proof. The proof is similar to the proof of Lemma 5.2 in [15j. We may assume that Xq has the 
uniform distribution on Z^. Let Z = (Zt)^Q be a random walk on Z^, independent of X and Y, 
such that Zq has the uniform distribution on Z^. For all sets S, let Ts = m.m{t : Zt S}. We 
have 

E[CloseM{U, V)] < P{Tu < M)P{Zt G V for some Tu<t< M\Tu < M) 

+ P{Tv < M)P{Zt G U for some Ty <t< M\Tv < M), (39) 

and the same inequality holds with W in place of V. By Proposition 12.81 there is a constant C 
such that 

CKM 



P{Tu <M) = E[Ca.VM{U)] < 
P{Tv <M) = E[CavM{y)] < 



log n ' 
CLM 



log n ' 

P{Tw <M) = E[Capj,j{W)] < -5^- 

log n 

The strong Markov property applied at Tjj implies that 

CLM 



P{Zt G V for some Tu < t < M\Tu < M) < E[C&-p j^^{V)] < 



n*^ log n 

Likewise, the strong Markov property applied at Ty gives 

CKM 



P{Zt G U for some Ty < t < M|Ty < M) < ElC&^^iU)] < 



log n 



It follows that the right-hand side of ([39l) is at most 2C^-ftrLM^/[n^(log n)^]. Since u > Tn, the 
bound for £'[CloseAf(C^, W)] now follows from Lemma [231 with f{U, V) = Closej\/([/, V). □ 
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Lemma 3.4. Let Y = (yj)^Q he a random walk on Z^. Let U = {x : Yt = x for some t < L}. 
Assume that L > C'nlogn and M > C'nlogn for some constant C . Then there exists a constant 
C such that 



2, , CL'^M^ 



n^(logn)^ 



Proof. Let X = (Xt)^Q and X' = (X^)^q be two independent random walks on such that 
Xq and X'q have the uniform distribution. Then 

E[CapM{Uf] = E[P{X[0,M] nU ^ $\Uf] 

= E[P{x[o,M] n [/ / ^u)P(x'[o, M]nu^ 0|c/)] 

= E[P{X[0, M]nu ^ 0, X' [0, M]nu ^ 0|C/)] 
= E[P{X[0, M] n [/ / 0, x'[o, M]nu ^ 0|X, X')] 
= E[C\ose l{X[0,M], X' [0, M] )] 

< 72 40 

n*(log n)^ 

by Proposition 13.31 □ 

We now work towards a lower-bound on the probability that a segment of a random walk on 
intersects a segment of a loop-erased random walk on Z^. Lyons, Peres, and Schramm [13] showed 
that for two transient Markov chains with the same transition probabilities, the probability that 
one intersects the loop-erasure of the other is at least times the probability that the paths of 
the two chains intersect. That is, erasing loops for one of the two chains reduces the intersection 
probability only by a constant. Because this result requires the Markov chains to be transient, 
it applies to random walks on Z^ only if the walks are killed at a geometric time. In Proposition 
13.61 below, we adapt this result to obtain a lower bound on the intersection probability for walks 
of fixed length. The proof uses the following lemma pertaining to the geometric distribution. 

Lemma 3.5. Suppose P{X = k) = p{l — p)^~^ for A; = 1, 2, ... . Then for all positive integers 
m, we have E[Xl^x>m}] = (1 ~ p)^{m + 

Proof. We have 

oo oo 

E[Xl^x>m}]=^P(.Xl^x>n.}>k)=mP{X>m)+ ^ P{X > k) 

k=l fc=m+l 

^ / 1 



= m(l-pr+ 2^ (l-p)'=-^ = (l-p)"(m+- 

k=m+l ^ ^ 

as claimed. □ 

Proposition 3.6. Let X = {Xt)'^Q and Y = {Yt)'^Q be independent random walks on Z^. 
Assume that Yq has the uniform distribution on Z^. If L/n^ oo and L/(n^(log n)^/^) as 
n ^ oo, then there is a positive constant C such that 

r<T2 

p{LE{x[o, L]) n y [0, L] ^ 0) > — — . 

log n 
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Proof. Let ^ > 1 be a number, to be chosen more precisely later, such that L/A is an integer 
greater than one. Let S and T be independent random variables, independent of X and Y, having 
a geometric distribution with parameter p = A/L. Therefore, ^(5" > L/A) = P(T > L/A) = 
(1 - A/L)^/^ > 1/4. By Proposition ESI there is a constant Ci such that 

p{x[o, s] n y [0, t] / 0) > p{s > l/a, t > l/a, x[o, l/a] n y[o, l/a] / 0) 

= P{S > L/A)P{T > L/A)P{X[0, L/A] D Y[0, L/A] / 0) 

^ CiL^ 
~ A'^n^ log n 

By Lemma 1.2 of [T3] . 

p{LE{x[o, s]) n y [0, r] / 0) > 2-^p{x[o, s] n y [o, t] / 0) > ^'f — . (4i) 

2,0 A^n^ logn 

The goal now is to choose A so that if LE{X[f),S]) and y[0,T] intersect, then LE{X[{),L\) 
and y[0,L] also intersect outside an event of low probability. We define the following events: 

• Let Bi be the event that LE{X\{), S]) n y[0, T] / and max{ S',T} > L. 

• Let B2 be the event that X[0, 5 - r„] n X[S', L] / and X[0, 5 - t„] n y[0, L] / 0. 

• Let S3 be the event that X[S - r„, S] n y [0, L] / 0. 

Note that if L^(X[0, 5]) n y[0, T] ^ but LS(X[0, L]) n y[0, L] = 0, then Bi U ^2 U S3 must 
occur. The event Bi accounts for the possibility that the segments L£'(X[0, L\) and y[0, L] could 
fail to intersect because the walks are run for a shorter time. If L > maxjS*, T}, then the segments 
coming from longer walks could still fail to intersect if the portion of the path LE{X[{), S]) that 
y [0, T] hits gets erased between times S and L. If this happens then either B2 or S3 must occur, 
with S3 accounting for the possibility that Y hits near the end of the path LS(X[0, S]) and S2 
accounting for the possibility that Y hits earlier in the path but this part of the path has been 
erased by an intersection with a later part of the X path. Thus, to obtain the proposition from 
([4TI1 . we must bound S(Si), ^(Sa), and P(S3). 

To bound S(Si), note that by Proposition 12.81 there is a constant C2 such that on the event 
S > n log n and T > n log n, 

p{x[o, s] n y[o, r] / 0|5, r) < — 



log n 



Therefore, by conditioning on S and T, we get 



S(Si) < P{X[0, S] n y [0, T] / and max{5, T} > L) 
< 2P{X[0, S] n y[0, r] / and S> L) 



2G 



< -i^E[STl{s>L}] + 2P(r < nlogn). 
log n ^ ' 

We know that S and T are independent and E[T] = 1/p = L/A. By Lemma 13.51 
E[Sl{s>L}] = (l - x) i^ + j)^ + l) - 
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Also, P{T < nlogn) < {Anlogn)/L. It follows that 

m)<^^^ + ^^^^. (42) 

To bound P{B2), note that conditional on X[0, 5" — r„], the probability that X[0, S — Tn] n 
Y[0, L] / is Cap^(X[0, 5-r„]), and by Lemma[23]the probability that X[0, S-Tn]nX[S, L] / 
is at most {3/2)Capi{X[0, S — Tn]). Because X[S,L] and F[0,L] are conditionally independent 
given X[0,S — Tn], we have 

P{B2) < -E[CapLiX[0,S-Tn]f] < -E[CapLiX[0,S]f] < (43) 
2 2 74^n°(lognj^ 

where the last inequality follows from Lemma [3.41 bv conditioning on S. Also by Proposition 12.81 

PiBs) < ^p^. (44) 
log n 



By choosing A sufficiently large, we can make the right-hand side of (j42p smaller than one-half 
the right-hand side of (j4ip . The right-hand of (j43p is much smaller than the right-hand side of 
(j4ip because L/(n^(log n)^/^) — > 0, and the right-hand side of (j44p is much smaller than the right 
hand side of (j4ip because L/r^ ^ oo in view of (jlOp . The proposition follows. □ 

We now use Proposition 13.31 to bound the probability that a segment of a random walk inter- 
sects a shorter loop-erased segment. The result does not follow immediately from Proposition 13.61 
because in Proposition 13.61 we require the two segments to be the same length. This assumption 
was required for the proof because, to apply the result of Lyons, Peres, and Schramm }13j . we 
needed to assume that the two killed random walks had the the same distribution, and therefore 
that they were killed at the same rate. However, we can extend the result to cover the case in 
which the loop-erased segment is shorter by splitting the longer random walk segment into pieces. 

Proposition 3.7. Let {Xt)'^Q and {Yt)'^Q be independent random walks on such that Yq has 
the uniform distribution on l^n- Assume that K < L with K/ri^ oo and L/(n2(log n)^/^) 
as n ^ oo. Then there exist constants Ci and C2 such that 

^'^^ < P{LE{X[0, K]) n y [0, L] / 0) < (45) 



log n log n 

Proof. We may assume that Xq also has the uniform distribution on Z^. Because 

p{LE{x[o, K]) n y[o, L] # 0) < P{x[o, K] n y[o, l] # 0), 



the upper bound in the proposition follows immediately from Proposition | 

To prove the lower bound, let M = \_L/2K\. If M = 0, then L < 2K and because 

P{LE{X[0, K]) n Y[0, L] / 0) > P{LE{X[0, K]) n y[0, K] / 0), (46) 

the lower bound follows from Proposition 13.61 Assume now that M > 1. For j = 0, 1, . . . , M — 1, 
we have 

P{LE{X% K]) n Y[2jK, (2j + l)K] / 0) > -^J^ 

log n 
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for some positive constant C3 by Proposition 13.61 The probability that LE(X[0, K]) intersects 
both Y[2jK, {2j + 1)K] and Y[2iK, {2i + 1)K] is at most the probability that X[0, K] intersects 
both of these segments, which is E[CloseK{Y[2jK, (2j + l)K],Y[2eK, (2£ + 1)K])]. If j / £, the 
intervals [2jK, (2j + 1)K] and [2iK, {2i + 1)-?^] are separated by a gap of at least K > Tn, so by 
Proposition 13.31 

E[CloseKiY[2jK, (2j + l)K],Y[2£K, {21 + l)K])] < ^^^^ 



n'^(log n)^ 



for some constant C4. There are M of these segments and M{M — l)/2 pairs of segments, so by 
inclusion-exclusion. 



n^logn 2n'^(logn)2 n^logn 

for some constant Ci, as claimed. □ 

Proposition 3.8. Let X = (Xf)^Q be a random walk on Z^. Assume that there are con- 
stants Ci and C2 such that Cin^{lognf/^^ < L < C2n^ilogn)^/^^ and Cin'^{lognf/'^^ < M < 
C2n'^(logn)^/^^. Then, for all 6 > 0, there is a constant C and a sequence of constants (an)^i 
with < inf On < supon < 00, possibly depending on the values of L and M, such that 



P 



C^Vm{LE{X[{),L])) 



(logn)2/ii 



(log n)^/22y (logn)^/^^" 



Proof. Let kj = j\n^{\ogn)^^'^'^^, and define v so that = k^ < ki < ■ ■ ■ < ky < L < k-u^i. 
Note that v < L/(n2(logn)''/22) < C2(log n)5/22. Let U = LS(X[0,L]), and for i = 1, . . . , let 
Vi = Li?(X[A;j_i, ki\) and Wi = X[ki^i, ki\. Let D be the set of points in U or in U^^-^Vi but not 
both. We first claim that 

V V i—1 V 

CapMiVi) - C^VkAD) GlosBMiVi, V,) < CapM(f/) < Yl CapM(^i) + CapM{D). 

1=1 i=l j=l i=l 

(47) 

To see this, let Y = (1^)^q be a random walk on started from the uniform distribution. If the 
path y[0, M] intersects U, then it must either intersect D or one of the Vi, which gives the upper 
bound for Capj^^(?7). For the lower bound, first note that, conditional on X[0, L], the probability 
that y[0, M] intersects U but not U^^iVi is at most the probability that it intersects D. By 
inclusion-exclusion, the probability that it intersects UjLj^Vi is at most the sum of the probabilities 
that it intersects the individual Vi minus the sum over pairs i and j of the probabilities that it 
intersects both Vi and Vj. 

Let Bi be the event that there exist s and t with s + 2r„ < t < L such that Xg = Xt- 
That is, Bi is the event that the path X[0, L] has a long loop of length at least 2t„. Fix ^ > 0, 
and let w = [n2(log n)^J . Let B2 be the event that the path X[0,L] has an interval [t,t + w] 
with < t < -L — n2(logn)^ that does not contain a local cutpoint. By Proposition 12.11] 
P{Bi) < C(logn)-'^/22, and by Proposition [2T3l ^(^2) < C(logn)~i. 

Let B = {Bi U B2y. On the event B, for each i = 1,2,... ,v, the path X[0,L] has local 
cutpoints Ui and Vi such that Ui G [fej_i,/ci_i + w] and Vi £ [ki — w,ki]. On the event B, if 
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ti < Ui < t2 < Vi < is, then X^^, Xt2, and Xt^ must all be distinct, as the definition of a local 
cutpoint prohibits short loops while long loops do not occur on It follows that if Uj < t < fj, 
then Xf is a point in the path LE{X[ki-i,ki — 1]) if and only if Xt is a point in the path 
LE{X[0,L]). That is, we have Xt £ U if and only if Xt G Vi. It follows that D C D', where 
D' = X[ki^i, + u;] U X[ki — w, ki] U X[ky,L]. By Proposition 12.81 the expected capacity of a 
random walk segment of length it? + 1 is at most CMw/{n'^\ogn). Because the capacity of D' is 
at most the sum of the capacities of 2v segments of length w + 1 plus the capacity of one segment 
of length at most n^(log n)^/^^, it follows that 

CMw \ C7Mn2(logn)4/22 



\ log n J 



log n 

< C(logn)^/22+9/22+e-i ^ ^(iQg^)9/22+4/22-i < c {logn)~^/^^+' . (48) 

We next bound CloseM(Vi, V^). We have CloseM (VijVj) < Close M(Wi,Wj), as erasing loops 
can only reduce the probability of intersection. Therefore, by Proposition 13.31 if j < i — 1 there 
is a constant C such that 

Finn.. (V v)^< CM\h-h.,){k,-k,^,) 4Cn«(iogn)^«/22+s/22 4rno..rW22 

E[CloseM(F.,VS)]< ^s^^^^^^p < ^g^j^^^^p <4C(logn) 

To bound the closeness between Vi and Vi_i, note that for the random walk y[0, M] to in- 
tersect both Vi and Fj-i, it must either intersect X[ki-i — which has probability at 
most CMw/{n^\ogn) < C(logn)~^^^'^'^~^^ by Proposition 12. 8^ or else it must intersect both 
X[ki^2,ki-i — and X[ki^i,ki — 1], which has probability at most C(log n)"-'^^/^^ by Proposi- 
tionEa Therefore, E[CloseM{Vi^i,Vi)] < C7(log n)~i3/22+e_ follows that 



V i—1 

E ]^ 

" 3 



X:ECloseM(V^.,^,)] < C^;(logn)-i=^/22+^ +c(^)(logn)-i«/22 < C(log (49) 

i=l j=l -I ^ ^ 

By combining ^ij, (08]), and (09]), we get 



E 



Capj\,j(C/) - ^ CapAf(V'i) 



LB 



<C(logn)-8/22+e^ 



Now Markov's Inequality gives 



p(^|capM(C/) -ECaPM(^.)| > ^(^^^^) < C(logn)-3/22+.^p(^c) < ^(log n)-3/22+^. 

(50) 

By Lemma 13.41 we have 

Var(CapA,(F.)) < i^[(Capjv,(V^))'] < E[iCa.pMiWi)f] < C(log n)-i8/22_ 
Because the random variables Capjy,j(Vi), . . . , Cap^,/(Vu) are independent, it follows that 



Var('^CapM(T^i)) < C(logn)- 



13/22 
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Therefore, by Chebyshev's Inequality, 



P 



2 = 1 



i=l 



> 



2(logn)5/22 



< C(logn) 



-^3/22 



(51) 



By Proposition 13.71 there exist constants Ci and C2 such that for all i = 1,. . . ,v, we have 
Ci(log < E[Capj^,j{Vi)] < C2(log n)~^/^^, and therefore there exist constants C3 and C4 
such that 



C3(logn)-2/ii <ii; 



1=1 



< C4(logn 



1-2/11 



Therefore, by setting a„ = (logn)2/ii^[^^^^ Cap^l^i)], we get the result from ([50|) and ([5T]) . □ 



3.3 Decomposing and labeling a loop-erased path 

Let X = (Xf)^Q be a random walk on Z^. Let T be a fixed or random time. In this subsection, 
we will consider the loop-erased path LE{X[0, T]). We will study the structure of the loop-erased 
walk by breaking the original path into segments of length r = [n^(log n)^/'^^J , and keeping track 
of which of these segments are retained after loop-erasure. This will lay the groundwork for the 
coupling in the next section, which will require a similar decomposition of the loop-erased paths 
used in Wilson's algorithm. 

To define the time indices that are retained after loop-erasure, for any positive integers u < v 
let a^'^ = max{t : u < t < v and Xf = Xu}- For i > 1, whenever cr^^i < v, let 

a^'^ = max{t : a^^"^ < t < v and Xf = 

Let W{u,v) denote the set of all times cr^'^ , so the path LE{X[u,v]) is the same as the path 
{Xt)t£W{u,v)- Note that if vi < V2, then {W{u,V2) H [ti,^!]) C W{u,vi) because new time indices 
can get erased but not added when the random walk is extended from time vi to time V2- We 
will consider segments of the random walk of length r. Define the jth such segment to be 
Aj = {{j - l)r, ...,jr- 1}, and let £ = [T/rJ + 1. Let 9 > 0, and then set w = [n2(logn)^J . Let 
A'j = {{j — l)r + w, . . . , jr — w — 1}, which is Aj with the first and last w points removed. We 
assume that n is large enough that w > 2Tn- Denote {Xt)t^Aj by 

We now define the events Bi, . . . , B^. When none of them occurs, we say the walk X[0, T] is 
"good." This condition is close to the condition introduced by Peres and Revelle [l5j for a path to 
be "locally decomposable." When X[0, T] is good, it will be possible to approximate LE{X[0, T]) 
using the loop-erasures of the individual segments of length r. 

• Let Bi be the event that X[0, T] has an interval [t, t + w] with < t < T — w that does not 
contain a local cutpoint. 

• Let B2 be the event that for some j = 1, . . . ,i — 1, there exist two times s,t € Aj U Aj^i 
such that t > s + 2r„ and Xg = Xf. 

• Let B3 be the event that there exist distinct integers i,j,k < £ such that \i — j\ > 2, 
\i — k\ > 2, and X[j4j] intersects both and ^[Afc]. 
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• Let i?4 be the event that there exist i < i and j < i with |i — j| > 2 such that Xg = Xt for 
some s £ Ai and t £ Aj\ A'- . 

• Let i?5 be the event that for some j < i — 2, we have -'^[^j] H X[j4^] 7^ 0. 

On Bf, the walk contains local cutpoints in every segment of length w. On B2, there are no 
long loops within segments of length 2r. The event prohibits one segment of length r from 
intersecting two other segments, while -B| prevents intersections involving the first or last w steps 
of one of the segments. On B^, the last segment does not intersect the others. Later, we will 
prove Proposition 14. 1^ which will establish that if T has a geometric distribution with a mean of 
order n^(logn)^/^, then the probability that X[0, T] is good is at least 1 — C(logn)~^/^^. 

To keep track of intersections between these segments of length r, let Ijj+i = for j = 
1,2, ...,£ — 1. For integers i and j with l<i<i, l<j<i, and j > i + 2, define lij to be 
the indicator of the event that LE{X[Ai]) D X[Aj] / 0. Following let 5o = {0} and, for 
j = 1, . . . ,i — 1, define 

Sj = {ke Sj-i : lij = for allie{l,...,k}n Sj-i} U {j}. 

Roughly speaking, Sj consists of the segments that have survived loop-erasure up to time jr. Let 
S = Si-i- Let Jj = {k < j : Ii^k = 1 or j = 1 for some i < j} consist of the indices of the 
segments that are involved in an intersection by time jr, and let J = Je-i U {I}. 

The next lemma will enable us to approximate the loop-erasure of the whole walk X[0, T] by 
the loop-erasure of the individual segments when the walk is good. The lemma states that for k S 
SnJ'^, the times retained in LE{X[Ak]) are exactly the times in Aj. retained in LE{X[0, jr — 1]), 
except possibly for the first and last w steps, while no times are retained for A; G 5"^ n J^. 

Lemma 3.9. Suppose X[0,T] is good. Then 

W{0, T) n A!^ = W{{k - l)r, fcr - 1) n A'^ 

for allk£ Sn and W{0, T) n = for all k £ ST\ 

Proof. We claim that for all j = 1, 1, we have 

W{0,jr - 1) n A'^. = W{{k - l)r, kr - 1) n A'^. (52) 

for k £ SjH J|, and W{0, jr — l)n A/^ = for all k S SjCiJ^. To see how the lemma follows from 
the claim, note that on , we have W{0, {£ - l)r - 1) n A'^. = W{0, T) n A'f^ for all A; < £ - 2. 
Furthermore, on B2, we have W{Q, {l — l)r — l)r\A'^_^ = W{Q, r)n These results, combined 

with (j52p when j = d. — 1, imply the theorem. 

It remains to prove the claim by induction on j. The claim is trivial when j = 1. Suppose 
the claim holds for j — 1, where j — 1 < i — 2. Suppose k £ Sj f] J?. There are three cases: 

• Suppose k < j — 2. Then k £ Sj-i n Jj^i, so by the induction hypothesis it suffices to show 
that W{0, {j — l)r — l)r\A'f, = W{0,jr — l)r\A'j^. We proceed by contradiction. Suppose this 
equality fails. Then Xf = Xs for some t £ Aj and s £ W{0, {j — l)r — 1) Ci [0, kr — w — 1]. 
On we must also have s £ A'- for some i = l,...,k, and on B^, we can not have 
i £ Jj-i. By the induction hypothesis W{0, {j — l)r — 1) n A'j = when i £ Sj_i Pi Jj^i, so 
i £ Sj-i n Jj-i- Using the induction hypothesis again, we get that W{0, (j — l)r — l)r\A'- = 
W{{i — l)r,ir — 1) n A[, which implies that Xg £ LE{X[Ai]). However, this means that 
lij = 1, which contradicts that k £ Sj. 
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• Suppose k = j-l. Again we must show W{0, {j - l)r - 1) n A'^^ = W{0, jr - 1) n A'j^. If the 
equahty fails, then Xt = Xg for some t G Aj and s E W{0, (j — l)r — 1) n [0, (j — l)r — w — 1]. 
On 51, we must either have s £ A'^ for some i = 1, . . . , j — 2, which leads to a contradiction 
as in the previous case, or (j — 2)r < s < {j — l)r — w — 1, which can not happen on i?2- 

• Suppose k = j. On Bf, between times {j — l)r and (j — l)r + lu, the walk {Xt)f^Q has a 
local cutpoint, which we call u. We claim that 

WiOJr - 1) n {n, . . . , jr - 1} = W{ij - l)r,jr - 1) n {u, . . . , jr - 1}, (53) 

which will imply (j52p . We prove (j53p by contradiction. Suppose ()53p fails. Then there exist 
s and t such that Xt = Xs, where u < t < jr — 1 and s G H^(0, u — 1)L) W{{j — l)r, n — 1). 
However, since n is a local cutpoint, this can only happen if t > s + 2r„, and then on B2, it 
must be that s < (j — 2)r — 1. However, on S|, we must have s £ A'- for some i = 1, . . . , j — 2. 
As in the first case, this would imply that lij = 1, contradicting that k £ = J?. 

Next, suppose k G Sj Ci Jj. Then I^ i = 1 for some h and i such that h < k < i < j and 
/i G Sj-i. This means that Xt G Li?(X[>l/i]) for some t £ Ai. Choose t = min{n G Ai : Xu £ 
LE{X[Ah])]. On Bl, it follows that Xt = X^ for some s £ W{{h - l)r, hr - 1) n A'f^. On B^ we 
must have h £ Jf_i, so by the induction hypothesis, s £ W{0, {i — l)r — 1) n A'^. It follows that 
1^(0, t) n = and since jr - 1 > t, we have W{0,jr - 1) n {s,t) = 0. Also, Ak C so 
M^(0, jr — 1) n ylyfc = 0. This completes the proof of the claim. □ 

For i = let Nj be the cardinality of the set W{0,T) n Aj, which is the number of 

points from the jth. segment retained after loop erasure. The next proposition bounds the random 
variables Nj. 

Proposition 3.10. Suppose X[0,T] is good. Then the following hold: 

1. Ifj£Sn J^, then \LE{X[Aj])\ -2w< Nj < \LE{X[Aj])\ + 2w. 

2. If j G 5^ n then Nj = 0. 

3. If j £ J, then < Nj < \LE{X[Aj])\ + 2w. 

Proof. Parts 1 and 2 of the proposition follow immediately from Lemma l3.9i It remains to prove 
part 3. Suppose j £ J. On i^g, there can be at most one value of k such that Ij or is 
nonzero, so there are three cases: 

• Suppose j = L On the random walk X[0, T] has a local cutpoint at some time between 
[i — l)r and {I — l)r + w, which we call u. Then 

W(0, T) n {n, . . . , T} = W{{t - l)r, T) n {n, . . . , T} (54) 

unless there exist two times s and t such that Xt = Xg, where u < t < T and s £ 
W{0,u — 1) U W{{i — l)r,u — 1). However, since u is a local cutpoint, this would imply 
t > s + 2Tn, which is impossible on B2 H B^. Therefore, (I54p holds. It follows that 
Ni< \LE{X[{£-l)r,T])\+w. Now if |LS(A:[(^ - l)r, r])| > |LS(X[yl^])| + 2r„, then there 
exist s < T — 2r„ and t > T such that Xg = Xt, which is impossible on It follows that 
Ne < \LE{X[Ae])\ + w + 2t„ < \LE{X[Ae])\ + 2w. 
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• Suppose Ikj = 1 for some k < j < i. Let 

t = mm{u > {j - l)r : ^ Xs for all v £ [u, jr - 1] and s € [0, kr -l]n W{0, u - 1)} 

be the first time within the jth segment after which there are no more intersections involving 
earlier segments. Then 1^(0, jr — l)n^j = W{t — 1, jr — l). On i?^^, the random walk X[0, T] 
has a local cutpoint u in [t—1, t—l+w]. On i?2, we have W {{j — l)r, jr—l)r]{u, . . . , jr—l} = 
W{t - l,jr -l)n{u,...,jr- 1}. Since W{0,T) n Aj C W{0,jr - 1) n Aj, it follows that 
N,<\LE{{Xt)teA,)\+w. 

• Suppose Ij^k = 1 for some j < k < i. On B^, we must have j £ Jk-v claim in 
the proof of Lemma EJl either Ty(0, {k - l)r - I) r^ Aj = % or 1^(0, {k - l)r - 1) n A'- = 

- l)r, jr - 1) n A'.. It follows that W{<d,T) n A'- C W{{j - l)r,jr - 1) n A'-, and 
therefore Nj < \LE{{Xt)teA,)\ + 2w. 

Because these three cases cover all j E J, the proof is complete. □ 

Remark 3.11. To prepare for the construction of the spanning tree in the next section, note that 
we can create a path whose vertex set is a subset of {1, ■ ■ ■ ,1} in two ways. First, let Vo = {0}. 
Given the path Vj-i for some j G {!,...,£}, define Vj by the following rules: 

• If there is a in the path Vj-i such that I^j = 1 (note that there can be at most one such 
k on i^g), then remove the vertices k + 1, . . . ,j — 1 from the path, and give the vertex k the 
new label j. 

• Otherwise, add the vertex j to the graph with an edge between j — 1 and j. 

Let V = Ve- Then, by the construction, the vertices in V are precisely the points in Se. If 
h <^2 < ■■■ < ik are the vertices, ranked in decreasing order, then there is an edge from to 
ij for j = 1, . . . , A: — 1. 

Alternatively, consider LE{X[0, T]), and give the points in {Xt)t£W(o,T)nAj the label j. Let V 
be the path whose vertex set consists of all j such that Nj > 0, and such that i and j are connected 
by an edge if and only if, when we write the points in W{0,T) as a^' < < • • • < cr^' , we 
have cr^' E At and cr^l^.^ G Aj for some k. That is, i and j are connected by an edge if there are 
adjacent points in LE{X[0,T]) with one labeled i and one labeled j. By Proposition 13. 10^ the 
vertex j is in V' j £ S Ci J'^ but not if j G S'^ Ci J'^. Therefore, the only difference between the 
paths V and V is that some j £ J could be included in one path but not the other. 

4 Coupling with the complete graph 

In this section, we obtain the scaling limit of the uniform spanning tree on by using Wilson's 
algorithm and coupling the construction of the uniform spanning tree on with the construction 
of the uniform spanning tree on the complete graph. This is essentially the same as the approach 
in [l^, although we describe the coupling slightly differently by coupling two labeled trees. 
Recall that 

r = [n2(logn)9/22j^ 
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which is the length of the random walk segments on that we will consider. Also, let < < 
1/22 be a small positive number, and let 

w = [n^(logn)^J. 

The significance of w is that a random walk segment of length at least w has a local cutpoint 
with high probability (see Proposition 12.13]) . When necessary, we will assume n is large enough 
that w > 2r„. 

The constructions of the spanning trees on the complete graph and on will require intro- 
ducing a considerable amount of notation. For the convenience of the reader, we summarize some 
of this notation in the table below. More precise definitions of these quantities will appear when 
they are introduced. 



Km,a Complete graph on m vertices with root added 

Gn^fS Graph with root added 

(T, d) Uniform spanning tree on complete graph K^n 

(T* , d* ) Labeled spanning tree on Km,a 

(T, d) Uniform spanning tree on Z^ 

(Tfc, d') Tree on Z^ built up after k walks in Wilson's algorithm 

(T*,d*) Labeled spanning tree on Gn,pi after collapsing segments 

Ri The ith segment of the tree considered in the algorithm 

R!j^ The interior of the ith segment considered in the algorithm 

Nj The vertices of the spanning tree on the torus labeled j 

Sj Vertices in the tree on Km,a after j steps of Wilson's algorithm 

Sj Segments in the tree on Gn^p after j steps of Wilson's algorithm 



4.1 A partial spanning tree on the complete graph 

To make the coupling work, we will choose the number of vertices in the complete graph to be 

m=[a-\lognf/'^\, (55) 

where a„ is the constant from Proposition 13.81 when we use r — 2w in place of M and r in place 
of L. Recall from the introduction that Km is the complete graph on Tfi vertices and Krn a is 
obtained from by adding a root vertex p that is connected to every other vertex by an edge 
of weight m/{a^/rn — 1). 

We now define a portion of a spanning tree on K^^Q^ using Wilson's algorithm. Choose 
vertices yi, . . . ,yk uniformly at random from Km- Define trees Tq, 7i, . . . , 7^ by running Wilson's 
algorithm on Km,a starting from the points p,yi, . . . ,yk- More precisely, for i = l,...,k, let 

= (y/)^Q be a weighted random walk on Km.,a such that Yq = y^. Let Tq consist of the single 
vertex p. For i = 1, . . . , fc, let Vi = min{t : 1^* is a vertex of Ti-i} and define 71 by adjoining 
to Ti_i the path LE{Y'''[0,Vi]). Let ki = 1 and for 1 < i < A;, let Kj+i = + + 1. Let 
/ = Kfc+i — 1. If 1 < i < i' , let f{j) be the value of i such that < j < Kj+i, so the jth vertex 
visited in Wilson's algorithm is part of the walk Y^^^h Then let Vi = y/^-'^ which is the ith 
vertex visited during the construction. 

For integers i and j with 1 < i < j < let Ijj be the indicator of the event that Vi = Vj ^ p. 
For 1 < j < i' , let Iqj be the indicator of the event that vj = p. We now obtain a tree T*, which 
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will be the same as the tree Tfc but with the vertices labeled in the order that they are visited 
during the construction. To obtain this tree, we follow Wilson's algorithm one step at a time. 
At each step, we keep track of the set Sj, which will be the nonzero vertex labels after j steps in 
the construction. 

Construction: Given ki < K2 < ■ ■ ■ < Kfe+i and given indicator random variables lij for 

< i < j < i', where i' = Hk+i — !> we define a labeled tree T* as follows. Begin with just the 
root vertex, labeled 0. Define Sq = 0. For j = 1,2, .. . ,£', let /(j), as above, be the value of i 
such that Ki < j < Ki, and proceed inductively as follows. 

• If lij = for all i G Sj-i, then add the vertex j to the graph, and draw an edge from j — 1 
to j unless j = kh for some h. Also, let Sj = Sj-iVJ {j}. 

• Otherwise, let i = min{/i G Sj-i : I^j = !}• There are three cases: 

1. If i = 0, then draw an edge from j — 1 to and let Sj = Sj-i. 

2. If f(i) = f{j), then a loop has formed, so we change the label of the vertex labeled i 
to j and erase vertices i + 1, . . . , j — 1. Let Sj = {Sj-i H {1, . . . ,i — 1}) U {j}. 

3. If f{i) < f{j), then the walk has hit the previous tree, so we draw an edge from j — 1 
to i unless j = k^ for some h. Let Sj = Sj-i. 

The tree T* is the tree with vertex set S^/ that has been obtained after £ steps of this algorithm. 

Because T* is obtained by Wilson's algorithm, it is the same tree as Tj. with the vertices 
relabeled. Note that the vertex yi initially has label Ki, although the label could change if this 
vertex is visited another time. For each i and j, there is a unique path in T* from yi to yj. Define 
d*{yi,yj) to be the number of vertices along this path, unless the path goes through p (i.e. the 
vertex labeled 0), in which case let d*{yi,yj) = 00. 

4.2 A partial spanning tree on the torus 

We now define a portion of a spanning tree on Gn,f3, which is the graph obtained by adding to 
a root vertex p, connected to all other vertices by an edge of weight 8/ (/3n^(log n)^/^ — 1). Choose 
points Xi,. . . ,Xk uniformly at random from Z^. Define trees Tq, 7i, . . . , 7^ by running Wilson's 
algorithm on Gn^p, starting from the points p,xi, . . . ,Xk. More precisely, for i = l,...,k, let 

= (Xj)^o be a weighted random walk on Gn,i3 such that Xq = Xj. We will also adopt the 
convention that p is an absorbing state, so if X| = p then we set X* = p for all s > t. Let % 
consist of the single vertex p. For 1 < i < k, let Ui = m.m{t : XI is a vertex of Then 
define % by adjoining to 7]_i the path Vi = LE{X^[Q, Ui]). 

To break up the branches of the tree into segments of length r, let (^1 = 1 and, for \ < i <k, 
let Ci+i = Ci + L^i/'^J + 1- ^ — Cfc+i ~ 1- If 1 < J < let g{j) be the value of i such that 
Ci ^ j < Ci+i- This means that the jth segment of length r is part of the walk X^^^\ If g{j) = i, 
define = {{j -Ci)r, . . . ,{j -Ci + l)r-l}. Also, let = {{j -Ci)r + w, . . . ,{j -Ci + l)r-w-l}, 
which is obtained by removing the first w and the last w points from Aj. To simplify notation, 
for 1 < j < we will denote the path X^^^^^lAj] by Rj, and we denote the path X^^^^^lA'j] by R'^. 
Also, denote X^^^^ [Aj \ A'j] by R~ . We call the paths Rj segments, and we call two segments 
and Rj^ adjacent if g{ji) = g{j2) and \ji - jal = 1- 
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We now define events Di, . . . , D^. When each of the walks ^*[0, Ui\ is good and none of the 
events Di,. . . ,0/^ occurs, it will be possible to analyze the spanning tree by breaking up the 
random walks into segments of length r. 

• Let Di be the event that some segment intersects two segments not adjacent to it at points 
other than p. 

• Let D2 be the event that there are non-adjacent segments Rj^^ and Rj^ such that Rj-^ and 
i?~ intersect at some point other than p. 

• Let D3 be the event that some segment intersects p and also intersects a non-adjacent 
segment at a point other than p. 

• Let 1)4 be the event that for some i, Rq intersects either p or a non-adjacent segment. 

Proposition 4.1. Let G he the event that X*[0, C/j] is good for i = 1, . . . , A; and that none of the 
events Di, D2, D3, or occurs. Then there is a constant C , which depends on k, such that 
P{G) > 1 - C(logn)-i/ii. 

Proof. Because the stopping times Ui depend on the paths, we will work also with the times Vi, 
where Vi = min{t : XI = p}. Then the stopping times Vi, . . . ,Vk are independent of the walks 
X^, . . . , X^. Each Vi has a geometric distribution with mean /3n^(logn)^''^, so there is a constant 
C such that £^[1^"'] < Cn^-^ (log n)-'/-^ for i = 1, . . . ,k and j = 1, 2, 3. Also, since p is in the tree 
To, we have Ui <Vi for all i. 

Because we will work with the potentially longer walks X*[0, 1^] in the proof, it is necessary 
to relabel the segments. Let Ci = 1 ^-^d, for 1 < i < fc, let O+i = Ci + L^/'^J + 1- Let 
£ = Ck+i — 1- If 1 < J < ^5 let g{j) be the value of j such that Ci < i < Ci+i- If 9ij) = define 

= {{j - Q)r, -Q + l)r - 1} and A'j = {{j - Ci)r + vu, . . . , {j - Q + l)r - w - 1}. For 

1 < J < ^, define Rj, R'p and i?J in the same way as Rj, R'j, and RJ but using g, Q, Aj, and A'- 
in place of g, Q, Aj, and A'j respectively. Finally, for i = 1,2,3,4, let Di be defined in the same 
way as Di. Because Ui <Vi for all i, we have Di C Di for i = 1,2, 3, 4, so to bound P{Di) it will 
suffice to bound P{Di). 

Recall that five conditions must be satisfied for the walk X*[0, to be good. Define the 
events B\, . . . , B\ in the same way as the events Bi, . . . , B5 were defined in the previous section, 
but using X*[0, Ui] in place of X[0,T]. We first consider By Proposition 12.131 for any j £ N, 
the probability that a random walk {Xt)'^Q on contains an interval of the form [t, t + w] with 
(j — l)n^(log n)^/^ < i < jn^(logn)^/^ that has no local cutpoint is at most C/(logn). Since 
Ui <Vi and Vi is independent of X^, it follows that 



piBi\v)< ^ 



logn 



Vi 



(logn) 



+ 1 



By taking expectations of both sides, we get P{Bl) < C/(log n) for i = 1, . . . , /c. 

We next consider the event i?2- If Ci ^ J < Ci+i; then the walk X^[Aj U ^j+i] has length 2r. 
If {Xt)^LQ is a random walk on Z^, then by Proposition 12.111 the probability that there are two 
times s and t such that |s — t| > 2r„ and Xg = Xt is at most C(2r)^/(n^ logn). It follows that 



P{B\\V) < 



4Cr^ ACVr 
< 



log n n^ log n 
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Taking expectations of both sides gives P{B\) < C{logn)^^/^^ . 

To show that each of the walks X^[0, Ui] is good with high probabihty, it remains to bound 
P{B^), P{Bl), and P{Bl). However, if S| occurs for some i, then Di occurs, and if B\ occurs 
for some i, then D2 occurs. Also, the last segment of X*[0,?7j] must either hit p or one of the 
segments X^[0,Uj] for j < i. Therefore, on n D^, the event Bi^ can not occur. Thus, to 
complete the proof, it suffices to show that P{Di) < C(logn)~^/^^ for i = 1,2,3,4. 

We first bound P{D2). Let Wj-^j^ be the event that Rj-^ and Rj^ intersect at some point other 
than p, meaning that Rj^ intersects either the first w or the last w steps of Rj2 ■ The segments Rj-^^ 
and Rj^ are either parts of different walks or are separated in time by at least r„, so Lemma 12.51 
and Proposition 12.81 imply that PiWj-^j^) ^ C rw /{n'^ log n). There are at most P pairs (ji, ^'2) 
to be considered, so 

p{D2\v„...,v,)<^:^. 

log n 

By taking expectations of both sides, and using that E[P] < Cn^r^^ log n for some constant C, 
we see that there is a constant C depending on k such that P{D2) < C(log n)~^/^^^^. 

To bound P(Di), suppose ji , j2 ) is < ^ are distinct integers and neither Rj^ nor Rj^ is adjacent 
to Rj^. On the event D2, the walk Rj-^ can only intersect Rj^ and Rj.^ if the walk R'^^ intersects 
both R'j^ and R'j,^, an event that we call Wj-^j^j^- Let Ai, A2, and A3 be any paths in of length 
r — 2w, and let Si, S2, and ^3 be paths in that come from independent random walks of 
length r — 2w, started from the uniform distribution. Since the segments R'j_^, R'^^, and R'j^ are 
separated from one another by at least 2w > Tn, applying Lemma 12.51 twice gives 

P{R'j^ = Ai,i?;.^ = A2,i?;3 = A3) < ^P{Si = Ai, ^2 = A2, 53 = A3). (56) 

Therefore, the probability that R'j^ intersects both R'j^ and R'j,^ is at most 9/4 times the probability 
that Si intersects 5*2 and 5*3. Thus, by Proposition 13.31 



9 Cr'^ 
PiWn,n,s) < 4i?[Close._2.(52, S3)] < ^^g^j^- (57) 



Since there are at most P possible choices of {ji,j2,j3)i we get 

Cr^P 



P{Di\Vi, . . . ,Vk) < 



n^(log n] 



2 • 



Since E[P] < Cn^r ^(logn)^/^ for some constant C, taking expectations of both sides gives 
P{Di) <_C(logn)-Vii. 

For 1)3, note that if Rj^ intersects p, then ji = Qj^i — 1 for some i = 1, . . . , fc. If Rj^ and Rj^ 
are not adjacent segments, then by Proposition 12.81 the probability that Rj-^ and Rj^ intersect at 
some point other than p is at most Cr^ jin^ log n). Conditional on Fi, . . . , Vfc, there are k possible 
choices for j'l and at most I for j2 , so 

P[p^\Vi,...,Vk) < 



log n 

Since E[£] < Cn^r~^(log n)^/^, we take expectations of both sides to get PiDs) < C{logn)^^/^^ . 
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Finally, to bound P{D/i), suppose ji = Because the time that the random walk first 
visits p has a geometric distribution with mean /3n^(log n)^/^, the probability that Rj-^ intersects 
p is at most r/(/3n^(log n)^/^) < C{\ogn)~^^^^ . If j2 ^ {ii,ii + 1}, then the probability that the 
walks Rj^ and Rj^ intersect is at most Cr^/(n^ log n) by Proposition 12.81 Therefore, following 
the same reasoning used to bound P{D^), we get P{D4^) < C(logn)~^/^^. □ 

For the rest of this subsection, we will work on the event G. The next step is to label the 
vertices of the tree 7^ by considering the segments of length r in the order that they are visited 
by Wilson's algorithm. To define the time indices that are retained after loop-erasure of the walk 
X*[0, Ui], let (Tq'^'* = max{t : u < t < v and Xf = Xu}- For j < 1, if crj_^{* < v, then let 

(T^'^'* = maxjt : a^'^^^ < t < v and Xf = X u.v.i , A. 

Let VF*(n, v) denote the set of all times cj'^'*, which are the times retained in LE{X^[u, v]). Give 
the root the label 0. For 1 < i < k and Ci ^ j < Ci+i) give the label j to all vertices of the tree Tj. 
that equal X'j: for some t in VF*(0, Ui) Ci Aj, other than the vertex X^,. Informally, the label j is 
given to all vertices in 7^ that are in the jth. segment of length r visited by Wilson's algorithm. 
For j = 1, let Nj be the number of vertices labeled j. 

For integers i and j such that 1 < i < j < i and Ri and Rj are not adjacent, define lij 
to be the indicator of the event that LE[Ri) n R'^ ^ 0. Note that we use R'^ for the definition 
rather than Rj. This will be convenient for the proof of Proposition 14.71 later, but on D2 the 
two definitions are equivalent. Also, for 1 < j < i, let Iqj be the indicator of the event that Rj 
intersects p, and if j > 2 and g{j) = g{j — 1), let = 0. Note that on D\ n D'^, for 1 < j < i, 

at most one of the random variables Iqj,Iij, . . . , Ij-ij can be 1. 

We can use these indicator random variables to construct a tree T*. Define T* to be the 
tree obtained from the construction in subsection 4.1, with Ci, . . . , (k+i in place of ki, . . . , k^+i 
and the lij in place of lij. Also, denote by Sq, . . . ,S£ the sets that were called Sq, . . . ,S£' in 
subsection 4.1, and let S = Se- We can view T* as being the tree obtained by collapsing each 
segment of length r in 7^ down to a single vertex. The vertices of T* correspond to segments of 
length r that are retained after loop erasure, with the convention that when segment j intersects 
segment i with i < j, segment j is retained while segment i is not. Also, Sj consists of segments 
that have not been erased after j segments have been processed by the algorithm. Finally, let 
Jj = {i 1^ j '■ h,h = 1 or I/j^j = 1 for some < h < j}, and let J = Jg. Thus, J consists of the 
segments that are involved in intersections. 

The next lemma shows that in the construction of T* , for i = 1, . . . ,k, the vertex — 1 
intersects the tree that has been built up after stage d — 1, which implies that indeed the final 
graph T* is a tree. 

Lemma 4.2. On G, we have O+i — 1 = min{j > Qi : I^j = 1 for some h G Sq-i U {0}} for 
i = 1, . . . , k. 

Proof. We proceed by induction. First consider i = 1. We have Ui = min{t : X^ = p}. Also, 
note that Ci = 1 and Sq = 0, and m.m{j : Iqj = 1} = 1 + [Ui/r\. Since, by definition, 
C2 — 1 = 1 + L^i/'"] ) the result for z = 1 follows. 

Now, consider i > 1. The tree Ti-i is simply a union of the paths Vi, . . . ,Vi-i, all of which 
come from loop-erasing good walks. By Lemma 13.9^ for h < i — 1, the only differences between 
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Vh and the corresponding loop-erased segments in STl [Qh-, Ch+i — 1] are the segments in J that are 
involved in intersections, and the first and last w steps of the segments. That is, for /i = 1, . . . , i— 1 
we have 




Note that by the strong induction hypothesis, Ch+i — 1 G Jc^^^^-i for h = 1, ... ,i — 1, which 
is important because the last segment of the path is always included in J in the definition 
used in Lemma 13.91 Because indices from previous paths are never removed from S", we have 
5 n [1, Ci - 1] = S"^,-!- It follows that 




Recall that Ui = min{t : is a vertex of Ti-i}. On we can not have X^. G for any 
j < Ci- If j G ^fi-i, then either Rj intersects some non- adjacent segment Rh for some h < Q — l, 
or Rj intersects p. Therefore on n D^^, if j G Jq-i, the point X^, can not be on the path 
R'j. It follows from ([58]) that either X^. = p, in which case /o,Ci+i-i = 1, or Xlj^ is in LE{Rj) 
for some j G 'S'^^-i, which on D2 means Ij^q^^^i = 1. Therefore, Q^i — 1 > min{j > Q : I^j = 
1 for some h G Sq-i U {0}}. 

Now, suppose j > Ci, h £ Sq-i U {0}, and Ihj = 1. If /i = 0, then Rj intersects p. If 
h G -S"^,-!, then R'j intersects LE{Rh). It follows from ([58|) that on n D2 ^ D'^, the segment 
Rj intersects Therefore, Ci+i — 1 < /i, which completes the proof. □ 

Lemma 4.3. On G, the following hold: 

1. IfjeSn then \LE{Rj)\ -2w< Nj < \LE{Rj)\ + 2w. 

2. If j G 5^ n then Nj = 0. 

3. If j G J, then < Nj < \LE{Rj)\ + 2w. 

Proof. On G, the tree 7^ is a union of the paths Pi, . . . , "P^, all of which come from good walks. 
Therefore, this result is an immediate consequence of Proposition 13.101 once it is clear that the 
sets S and J are defined for the tree 7^ the same manner as they are defined in Proposition 13. 10] 
However, this can be seen from the definition, and the fact that Q^i — 1 G J for all i = 1, . . . , A; 
by Lemma 14.21 Additional indices are placed in J because of intersections between different 
paths, but this does not affect the result because the claims for j £ J are weaker than those for 
j G □ 

If 1 < z < J < /c, then there is a unique path in 7^ between the vertices Xi and xj. Let 
d'{xi,Xj) be the number of vertices in this path, unless the path goes through the root p, in 
which case let d'{xi, Xj) = 00. Note that for i = 1, . . . ,k, the vertex Xj initially gets the label Q, 
and on this label never changes. Therefore, the path in T* from Q to Q corresponds to the 
path in 7^ from Xi to Xj. Define d*{xi,Xj) to be the number of vertices of the path in the tree T* 
between Q and Q, unless this path goes through 0, in which case let d*{xi,Xj) = 00. The next 
proposition allows us to compare d'{xi,Xj) and d*{xi,Xj). 
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Proposition 4.4. Let Sij be the set of vertices on the path in T* from Q to Qj. On G, we have 



d'ixi,Xj)- < ^Nh. 



Proof. We claim that if h € J'^, then the following are equivalent: 

• There is a vertex labeled h on the path in 7^ from Xi to xj. 

• Every vertex labeled h is on the path in 7^, from Xi to xj. 

• We have h £ Sij. 

This claim immediately implies the proposition. 

To prove the claim, we first show that the first two statements are equivalent. The path in 
Tk from Xi to xj is made up of portions of the paths Vi, . . . ,Vk- The vertices labeled h, if there 
are any, are consecutive points along the path 'Pg(h) . Therefore, the only way that the path in 7^ 
from Xi to Xj can traverse some but not all of the vertices labeled h is if one of the other paths 
Vb with b > g{h) hits the path at one of the vertices labeled h. However, as observed in 

the proof of Lemma 14.21 o^i G this implies that I/j = 1 and therefore h £ J. 

It remains to show that the first two statements are equivalent to h £ Sij. However, this 
part of the claim is just an extension of Remark 13.111 Each of the paths Vi, . . . ,Vk comes from 
loop-erasing a good segment of a random walk, so by Remark 13.11^ the only difference between 
the set of labels of vertices in Vi and the set 5* n [0, Ci+i ~ 1] is that numbers in J could appear 
in one but not the other. Because the path in 7^ from Xi to Xj traverses portions of these paths, 
and the path in T* from Q to Q traverses the corresponding paths in T* , it follows that h £ 
then h £ Sij if and only if there is a vertex labeled h on the path in 7^ from Xi to Xj. □ 

4.3 Bound on the probability of coupling 

In this section, we show that it is possible to couple the trees T* and T* such that T* = T* 
with high probability. This will allow us to compare the distances between points in a uniform 
spanning tree on the complete graph and distances between points in a uniform spanning tree on 
the torus. We first state a lemma that we will use to bound the probability that our coupling is 
sucessful. 

Lemma 4.5. Let Yi, . . . ,Yn and Zi, . . . ,Zn be {0, l}-valued random variables. Letpi = P(Yi = 1) 
and Qi = P{Zi = 1). Let pij = P{Zi = Zj = 1) and qij = P{Yi = Yj = 1). Then there exists a 
coupling of (Yi, . . . , Yn) and {Zi, . . . , Z„) such that 

n n 

P{Yi ^ Zi for some z = 1, . . . , n) < ^ "^iPij + Qij) + Y\Pi- Qil- 

i=l jj^i i=l 



Proof Note that P{Yi = for all i) > I - YA=iPi and P{Zi = for all i) > I - YTi=i H- For 
i = 1, . . . , 71, we have P(Yi = 1 and Yj = Q for j ^ i) > Pi — ^j^iPij and P{Zi = 1 and Zj = 
for j ^ i) > Qi — Ylij^i lij- Therefore, the two sequences (Yi, . . . , Yn) and (Zi, . . . , Zn) can be 
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coupled so that 1^ = Zj = for all i with probability at least min{l — 1 — '^^=iQj} 

and, for each i, we have Yi = Zi = 1 and Yj = Zj = for j ^ i with probability at least 
mm{pi,qi} — Ylj^iiPij + With this coupling, if A is the event that Yi = Zi for all i, then 

l- n n >. n n 

P{A) > min S 1 - X]^'*' ^ ~ X] f ^ minfe. 9i} - X] X]^^*^' + 
I- j=i j=i ' 1=1 i=i 

n ra n 

> 1 - ^ max{pi, + ^ mm{pi,qi} - ^ ^(Pij + Qij) 

i=l 1=1 i=l jy^i 

n n 

= 'i--^\pi-qi\-Y^ ^{pij + Qij), 

i=l i=l jj^i 

which implies the lemma. □ 
Lemma 4.6. There exist constants C and C , depending on k, such that 

C 



P{i > (:7'(logn)2/22(loglogn)) < 



log n 



Proof. Because the time for a random walk on Gn,i3 to visit the root has a geometric distribution 
with mean Pn'^{logn)^/'^ , the distribution of Ui + ■ ■ ■ + Uk is stochastically dominated by the 
distribution of the sum of k independent random variables, which we call Zi, . . . , Z^,, each having 
the geometric distribution with mean /3n2(logn)^/2. If Z has the geometric distribution with 
mean m > 2, then 

P(Z > am) = 1 < 1 e-" < le'". 

\ m J \ m J 

Therefore, for sufficiently large n, 

P{Ui + • • • + C/fc > k(3n^{\ognf>'^{\og\ogn)) < kP{Zi > [5n^{\ognfl'^{\og\ogn)) 

9k 

< 2A:e-i°s'°s" = -— . (59) 
logn 

Because £ = '}2i=ii^ + L^j/'^J) = [n2(log n)^/^^] , the lemma follows. □ 

Proposition 4.7. For all > 0, the random variables Ii,j and lij can be coupled so that 

C 

Pilij = iij for all j <i andiG Sj-i U {0}) > 1 - 

for some constant C that depends on k. 

Remark 4.8. Suppose that /jj = lij for all j < i and i £ Sj-i U {0}. Then, by induction on j, 
it follows that Sj = Sj for all j < £. Therefore, by the construction of the trees T* and T*, we 
have T* = f*. Thus, Proposition O implies that P{T* = f*)>l- C(log n)-i/22+e. 
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Proof of Proposition \4. 7| Our strategy will be to couple the random variables lij and lij for 
fixed j, using Lemma 14.51 and then proceed by induction on j. We first consider the random 
variables lij conditional on Sj-i. Because the random walk on Km,a goes to p on each step with 
probability l/{ay/m), we have 

P(/o,, = l\Sj^i) = -4=. (60) 



a^/m 

Also, when the random walk on -fCm,a does not go to p, it visits a randomly chosen vertex. 
Therefore, if i G •S'^-i, then 

p(/„ = i|Vi) = (61) 



a\/m / m 



h,i E Sj-i, then Vh 7^ Vi. Because the random walk can only visit one vertex at a time, if 
h,i £ Sj-i and h ^ i, then 

P{h,j = = l\Sj^i) = 0. (62) 

For all j > 0, let J-j be the cr- field generated by the segments Ri for i < j. Note that the 
event i S Sj is in J^j. Since Iqj = 1 if and only if Rj intersects p, and on each step the random 
walk goes to p with probability l/(/?n^(log n)^/^), we have 



P(k, = = 1 - ( 1 - ^-.^j^] . (63) 



It follows from ([9]) that for all x G Z^, we have 



< J_2-L"'AnJ < 

log n 



for some constant C. If the walk R'^ started from a point that was uniformly distributed on Z^, 
then since A'j is a segment of length r — 2w, this walk would intersect a set U with probability 
Capj._2w{U) and would intersect both U and V with probability Closer-2w{U,V). Combining 
this observation with ([MI) and the definition of lij, we get that for i £ Sj-i, 

Pil.j = l|.F,_i) = f 1 - / ) (1 + i^i,,)Cap,_2^(Li?(i?,)), 
V pn"^(logn)^/^/ 

where the first term is P{Io.j = 0) and the error term \Eij\ is bounded by C/(logn). Since 
P{Ioj = 1) < r/(/3n2 (log n)V'2) < C/(logn)i/", we have 

P(/ij = l|.F,_i) = (1 + ii;,',,)Cap,_2^(Lii;(i?i)), (65) 

where the error term {E'i j\ is bounded by C/(log n)^/^^. Likewise, if i S Sj-i, then 

F(/„, = = i|^,_o < (1 - (1 - ^„.(i4„)v0 ')(' + ki^)c«P.-..(i^(«.)) W 



and if /i, i G •S'j-i with h ^ i, then 

logn J 



Pihj = = MJ'j-i) < ( 1 + r^) Closer-2v,{LE{Rh),LEiR^)). (67) 
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Let Wj be the event that 



J = min{z : j 7^ //^ j for some h £ Si-i U {0}}. 

If none of the events Wi, . . . , Wj-i occurs, then Sj-i = Sj-i. Therefore, by combining equations 
(jeOl) . (IHTI) . (f62]) . ([63]) . (165]) . (fMl) . and ([67]) and applying Lemma |431 we see that the random 
variables /j.j and lij can be coupled so that 



1 



+ E 

«e5j_i 
+ 11 + 



1 



a\ ra / m 



/9n2(log n)i/2 
1 



(l + S,',^.)Cap,,_2^(Lii;(i?.)) 



c 



1 



1 



1 



logny\^ \^ /3n2(logn)i/^ 
C 



Cap,_2^(L^(i?i)) 



+ X] (l + |^)ciose,_2»(L^;(i?h),L^(i?i))- 



(68) 



By ([3]), the first term on the right-hand side of ()68p is zero. To bound the third term, note 
that ii^[Cap^_2tt,(-Z^-E(-Ri))] < Cr^/(n^ logn) by Proposition 13. 71 so the expected value of the third 
term is at most 



(j-l) 1 + 



Cr2 ^ C(j 



logny \ /3n2 (logn) y n'^ logn ~ ^^(logn)'^/^ ~ (logn)^/^^ 



l)r3 ^ C(j-l) 



Likewise, using Proposition l3.3l the expected value of the fourth term is at most C(j — l)^(log n) s/22_ 
We now consider the second term. By Proposition 13.81 fo'^ ^ ^ we have 



P 



Cap,_2»(Li?(i?,)) 



(logn)2/ii 



> 



< 



C 



(logn)^/22y (logn)^/22 ^ 



(69) 



Let y be the event that \Capr_2^{LE{Ri)) - a„(logn)-2/ii| < (logn)-5/22 for ah i < £, and 
let Vj be the event that this inequality holds for all i < j — 1. By (j55p . we have l/(m + 1) < 
a„/(log n)2/^^ < 1/m, so |l/m — a„/(logn)2/^^| < l/m^ < C/(log n)'^/^^. Therefore, on I^-, we 
have \l/m-Ca.p^_2^{LE{Ri))\ < C7/(log n)5/22 for all i G S^j^i. Also, l/(am3/2) < C7/(logn)3/ii 
and, since \Elj\ < C/{logn)^/^\ we have \EljCap^_2^{LE{Ri))\ < C/(logn)3/ii ^j. ^^^j ■ ^ 
on Vj-. Therefore, on Vj, the second term on the right-hand side of (j68p is at most Cj/(log n)^/22^ 
Taking expectations of both sides in (j68p . using that Vj E and combining these bounds, 

we get 



P(w^j nv)< P{Wj n F,) = E[P{Wj\Tj -i)iv^] < 



Cj 



+ 



cf 



(70) 



(logn)^/22 (logn)^/22 

Let K = C"(log n)2/22(log log n), where C is the constant from Lemma 14.61 Then, using ()69p . 
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([To]) , and Lemma \TM we get 

p([jw,) <p([j{w,nv)\+p{v') 
< > K) + ^ p{Wj nv) + ^p 

j=i i=i 



Cap,„2^(Li?(i?,)) 



(logn)2/ii 



> 



(log n)^/22 



C Ci^2 (27^3 C7(loglogn) 
< 1 ■ \ ■ \ ■ < ^ & & ^ 



logn (logn)^/22 (logn)8/22 (log n)3/22-6' (logn)i/22-^' ■ 
Since this result holds for all 6 > 0, the proposition follows. □ 

4.4 Distances in the uniform spanning tree 

Recall from the introduction that yi, . . . ,yk are k points picked uniformly at random from Km, 
and T is a uniform spanning tree on Km- The distance d{yi, yj) is the number of vertices on the 
path between yi and yj in T, and the distance d*{yi, yj) is the length of the path between 7/j and 
yj in the tree T* , unless the path goes through p in which case d*{yi,yj) = oo. Likewise, recall 
that points chosen uniformly at random from Z^. The distance d{xi,Xj) is the 

number of vertices on the path between Xi and xj in the uniform spanning tree T on Z^. Also, 
d'{xi,Xj) is the length of the path from Xi to Xj in the tree 7^, while d*{xi,Xj) is the length of 
the path from Q to ("j in the tree T*. Both d'{xi,Xj) and d*{xi,Xj) are set to oo when the path 
goes through the root. Peres and Revelle proved the following result in section 7 of [15]. 

Lemma 4.9. The total variation distance between the joint distribution of the (2) distances 
(d(xj, Xj))i<j<j<fc and the joint distribution of the distances ((i'(xj, Xj))i<j<j<fc is at most the 
probability that for some i,j < k, the path in from Xi to xj goes through the root p. Likewise, 
the total variation distance between the distributions of {d{yi,yj))i<_i^j<_k one? {d* {yi,yj))i<i<:j<:k 
is at most the probability that for some i,j < k, the path in T* from y^ to yj goes through p. 

The argument of Peres and Revelle relies on a form of stochastic domination of spanning 
forests by spanning trees, which follows from a result of Feder and Mihail [6]. Note that Tk is a 
portion of a uniform spanning tree on G^fi (the rest of the tree can be constructed by continuing 
with Wilson's algorithm), and when edges in this spanning tree leading to the root p are removed, 
the graph that remains is a random spanning forest on Z^. See section 7 of |15] for details. 

Because the complete graph satisfies the conditions of the graphs studied by Peres and Revelle, 
the following Lemma follows immediately from Lemma 7.2 of [15J. 

Lemma 4.10. Let e > 0. For sufficiently large a, there exists an M such that if m > M then 
the probability that for some i,j < k, the path in T* from yi to yj goes through p is less than e. 

The next lemma bounds the number of segments that are involved in intersections. Recall 
that J = {j < i ■ lij = 1 or Jj j = 1 for some <i < £}. Let \J\ denote the cardinality of J. 

Lemma 4.11. There exists a constant C depending on k such that 

P(|J|>(logn)V22)<C^(l°gl°g-)' 



(logn)^/' 



22 
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Proof. As in the proof of Proposition l4.71 let K = C"(log n)^/^^(log log n), where C is the constant 
from Lemma [4. 61 By Lemma [4. 61 we have P{£ > K) < C/(logn). By taking expectations in ()65p 
and using Proposition 13.71 we get 



Pikj = 1) < 



Cr2 



n'^ log n 

for all 1 < i < j. By ([MD, P{Io,j = 1) < Cr/ (n^ (log n)V2) for ah j. Therefore, for all i < K, 



CKr^ 

P{Iij = 1 or /j- i = 1 for some < j < K) < — + 



Cr 



< 



C log log n 



logn (log n) (logn)^/ 



22 



It follows that 



i^P n {!,..., i^}|]<g^^^<C(log log n)^. 



(log nYl'^'^ 



Using Markov's Inequality, 



P[\J\ > (log n) V22) < p(|j n {1, . . . , K}\ > (log n)V22) + > ^) < ^^l^^^log^ ^ 

(logn)^/^^ logn 



and the lemma follows. 



□ 



Lemma 4.12. There exists a constant C depending on k and a sequence of constants (7n)^i 
with < inf7„ < sup7„ < oo such that outside of an event whose probability is at most 
C(loglogn)^(logn)~-'^/^^, we have 



d (^XiyXj^ d (^X{^Xj^ 



^ 7nn2(log n)i/6 



< 



C 



(log n)i/22 



Proof. By Proposition 14.41 on G we have 

d'{xi,xj)- <Y.^h. 



(71) 



By recalling that d*{xi,Xj) = \Sij\ = \J Ci Sij \ + | J'^ n Sij\ and splitting the sum into h & J and 
h £ J^, we get 



6„n2(logn)5/66c?*(x,,x,)- 



heSi,j 

<6„n2(logn)5/66|j|+^Ar^+ ^ \Nh - bnn\lognf/''^\ 



(72) 



where {bn)'^=i is the sequence of constants defined in Corollary 13.21 with L = r. Choose 7^ such 
that ^/m6„n^(logn)^/^^ = 7^/1^ (log n)^/^. As m = [a~^(logn)^/-'^iJ , we get 7„ = 5„an"^^^(l — 6n), 
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where < 5„ < C(logn) ^/^""^ for some constant C. The fact that < inf7„ < sup7„ < oo now 
follows from Corollary 13.21 and Proposition 13. 8[ Combining ([7T|) and (j72]) , we get 



7„n2(log n)^/^ 



< 



(logn)5/66 



+ 



h£Sij 



< 



7„n2(logn)V6 7„n2(logn)V6 
1^1 



'„n2(logn)V6 ^^'^ + ^ 
^ Yl \Nh-bnnHlognf/'^\ 



7nn^(log n)^/^ 



(73) 



heJ'^nSi. 



It remains to show that the three terms on the right-hand side of (j73p are small. 

Let H be the event that \LE{Rj) - bnn^{\ognf/^^\ < C'n^{\ogn)-^^/^'^ for ah j < I, where 
C is the constant from Corollary 13.21 Then Corollary 13.21 and Lemma 14.61 imply 



P^H") < P{e > K) + 



CK 



< 



C(log logn) 



(logn)4/22 - (logn) 1/11 • 
Combining this observation with Proposition 14.11 and Lemmas 14.61 and 14.111 we get 

C(log log nf' 



P{Gr\Hr\{\J\ < {logny^^^}n{£ < K}) > 1 



(log n)^/22 



By Lemma 14.31 on this event we have 
2 



7„n2(logn)^/^ 

and 

Also, again using Lemma 14.31 
1 



h&j 



< 



C\J\ 



n2(log n)^/^ 



• n 



'(logn)^/^^ < 



C 



(log n)^/22 



< 



C 



m (logn)^/22 



7„n2(log n)^/^ 



\Nh-bnn\\ognf/''^\< 



CI 



C'n 



'^2 



< 



n2(logn)V6 y^"^ + (log 71)15/44) 

C(loglogn) C(loglogn) 
(logn)V6-i/ii-(? " (logn)5/66-e- 



The result follows from (1731) and these bounds. 



□ 



Proof of Theorem \l.l[ By Aldous' result ([T]), the joint distribution of d{yi,yj)/^/m for 1 < i < 
j < k converges to /i^ as m — > 00. Therefore, given e > 0, it suffices to show that for sufficiently 
large n, the total variation distance between the joint distribution of these distances and the joint 
distribution of the distances (i(xj, Xj)/(7„n-^(log n)i/^) is less than e. 
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By ([3]) Lemma I4.10| we can choose a and /? large enough that for sufficiently large m, the 
probability that for some i, j < k the path in T* from 7/j to yj goes through p is less than e. There- 
fore, by Lemma the total variation distance between the distributions of {d{yi,yj))i<i<:j<k 
and {d*{yi,yj))i<i<:j<k is less than e for sufficiently large m, and thus for sufficiently large n. 

Suppose T* = T*. If for all i,j < k, the path in T* from yi to yj does not go through p, 
then for all i,j < k, the path in T* from to Cj does not go through p. However, on G this 
happens if and only if for all i,j < k, the path in 7^ from Xj to xj does not go through p. In 
view of Proposition 14. II and Remark 14.81 it follows that for sufficiently large n, the total variation 
distance between the distributions of {d{xi,Xj))i<i<^j<k and {d' {xi, Xj))i<i<^j<k is less than e. 

For i,j < k, we have d*{xi,Xj) = d*{yi,yj) on the event T* = T*. Thus, for suffi- 
ciently large n, the total variation distance between the distributions of ('^(yi, yj))i<i<j<fc and 
{d* {xi, Xj))i<:i^j<k is less than e. The result follows from these observations and Lemma [4. 121 □ 
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